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Preface 

The  chapters  in  this  volume  were  presented  at  the  July  2000  NATO  Advanced  Study 
Institute  on  Twentieth  Century  Harmonic  Analysis  -  a  Celebration.  The  conference  was  held 
at  the  beautiful  II  Ciocco  resort  near  Lucca,  in  the  glorious  Tuscany  region  of  northern  Italy. 
Once  again  we  gathered  at  this  idyllic  spot  to  explore  and  extend  the  reciprocity  between 
mathematics  and  engineering.  The  dynamic  interaction  between  world-renowned  scientists 
from  the  usually  disparate  communities  of  pure  mathematicians  and  applied  scientists,  which 
occurred  at  our  1989, 1991, 1992,  and  1998  ASPs,  continued  at  this  meeting. 

Almost  exactly  one  century  ago  harmonic  analysis  entered  a  (still  continuing)  golden 
age,  with  the  emergence  of  many  great  masters  throughout  Europe.  Some  of  these  illustrious 
names  were:  Hardy,  Littlewood,  Landau,  van  der  Corput,  Hadamard,  de  la  Vallee-Poussin, 
Tchebychev,  Bernstein,  Markov,  Fej^r,  Riesz,  Polya,  Szego,  etc.  They  created  a  wealth  of 
profound  analytic  methods  which  were  so  successfully  exploited  and  further  developed  by 
succeeding  generations.  This  flourishing  of  concrete  harmonic  analysis  is,  today,  as  lively 
as  ever,  with  such  contemporary  great  analysts  as  ASI  lecturers  and  authors  Havin,  Kahane, 
Shapiro,  Weiss,  Salem  Prize  winner  Komer,  and  Nobel  Laureate  Hauptman. 

In  addition  to  its  own  ongoing  internal  development  and  its  basic  role  in  other  areas  of 
mathematics  (number  theory,  differential  equations,  probability,  statistics),  physics  and  chem¬ 
istry  (visible  light  and  infrared  optics,  crystallography,  wave  phenomena),  financial  analysis 
(time  series),  medicine  (tomography,  brain  and  heart  wave  analysis),  and  biological  signal 
processing,  harmonic  analysis  has  made  fundamental  contributions  to  essentially  all  twenti¬ 
eth  century  technology-based  human  endeavors.  This  includes  telephone,  radio,  television, 
radar,  sonar,  satellite  communications,  medical  imaging,  the  internet,  and  multimedia.  This 
ubiquitous  nature  of  the  subject  is  touched  upon  by  many  of  the  authors. 

Thus,  the  ASI  and  this  volume  are  intended  not  only  to  promote  the  infusion  of  new  math¬ 
ematical  tools  into  applied  harmonic  analysis,  but  also  to  fuel  the  development  of  applied 
mathematics  by  providing  opportunities  for  young  engineers,  mathematicians,  and  other  sci¬ 
entists  to  learn  more  about  problem  areas  in  today’s  technology  that  might  benefit  from  new 
mathematical  insights. 

Many  of  the  world’s  harmonic  analysis  experts  were  principal  speakers  at  the  ASI,  and 
their  chapters  appear  in  this  volume.  These  renowned  scientists  address  their  talks  and  chap¬ 
ters  to  an  audience  which  consists  of  a  broad  spectrum  of  pure  and  applied  mathematicians, 
as  well  as  a  diverse  group  of  engineers  and  scientists.  Thus,  the  reader  has  the  opportunity  to 


leam  or  reinforce  fundamental  concepts  from  the  individuals  who  have  played  a  major  role 
in  the  ongoing  flourishing  of  harmonic  analysis,  and  to  see  them  discuss  in  accessible  terms 
their  profound  contributions  and  ideas  for  future  research. 

Victor  Havin  begins  this  book  by  clearly  describing,  both  heuristically  and  precisely, 
how  one  should  view  “smallness”  in  the  context  of  the  uncertainty  principle.  His  chapter 
gives  the  reader  a  fundamental  understanding  of  one  of  the  major  themes  of  classical  Fourier 
analysis,  and  also  discusses  important  applications  of  the  uncertainty  principle  to  physics  and 
engineering. 

Harold  Shapiro  beautifully  shows  how  operator  theory,  involving  genuinely  infinite-di¬ 
mensional  constructions,  has  profound  applications  to  concrete  classical  harmonic  analysis 
problems.  A  wonderful  case  in  point  is  D.  Sarason’s  recognition  of  the  fundamental  role 
played  by  commutativity  in  understanding  (and  getting  radically  new  proofs  of)  classical 
interpolation  and  moment  problems,  like  those  usually  associated  with  the  names  of  Pick- 
Nevanlinna,  Caratheodory,  and  Schur. 

Jean-Pierre  Kahane  offers  clear  and  concise  insight  into  the  interactions  between  Baire’s 
category  theorem,  Lebesgue’s  measure  theory,  and  trigonometric  series. 

A  beautiful  exposition  of  the  current  state  of  the  art  in  Gabor  theory  is  presented  by  Guido 
Janssen.  He  details  the  key  role  played  by  the  Gabor  frame  operator  associated  with  the  set 
of  elementary  signals  being  used  in  the  expansion  of  a  given  signal.  Working  in  the  time, 
frequency,  time-frequency,  and  Zak  transform  domains,  he  addresses  the  basic  problems  of 
whether  there  is  a  Gabor  frame  and  how  to  compute  the  dual  frame. 

While  describing  the  many  fascinating  twists  and  turns  that  can  occur  when  rearranging 
orthogonal  series,  Tom  Komer  clarifies  the  surprising  phenomenon  that  wavelet  expansions 
continue  to  work  well  under  decreasing  rearrangements  whereas  Fourier  series  do  not.  His 
focus  is  on  the  work  of  Olevskii,  Tao,  and  others  in  this  area. 

Stephane  Jaffard  clearly  describes  the  interplay  between  function  spaces,  wavelet  expan¬ 
sions,  and  multifractal  analysis.  For  example,  he  shows  how  refinements  of  the  numerical 
techniques  introduced  to  compute  turbulence  spectra  have  led  to  the  introduction  of  new 
function  spaces,  which  turn  out  to  be  the  right  setting  to  determine  the  fractal  dimensions  of 
graphs,  and  offer  natural  extensions  of  the  Besov  spaces  to  negative  p’s. 

Richard  Tolimieri  and  Myoung  An  offer  significant  insight  into  the  emergence  at  IBM 
of  Fourier  transform  algorithms,  a  critical  step  in  advancing  the  widespread  use  of  digital 
computers  in  scientific  and  technological  applications.  They  show  how  early  efforts  focused 
on  reducing  “expensive”  multiplications  at  the  cost  of  increasing  additions,  and  how  the  re¬ 
cent  importance  of  FPGA’s  and  reconfigurable  hardware  has  renewed  the  need  to  reduce 
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multiplication  counts.  Throughout  they  weave  in  the  interactions  between  these  fascinating 
computational  techniques  and  harmonic  analysis. 

Herb  Hauptman  describes  an  application  of  harmonic  analysis  which  is  yielding  profound 
medical  benefits  daily,  and  for  which  he  won  the  1985  Nobel  Prize  in  chemistry.  Namely,  he 
shows  how  the  known  atomicity  of  crystal  structures  and  the  redundancy  of  the  observed  mag¬ 
nitudes  of  the  normalized  structure  factors  of  the  X-ray  diffraction  pattern  render  the  classical 
phase  problem  of  X-ray  crystallography  solvable.  He  goes  on  to  describe  his  new  “Shake- 
and-Bake”  algorithm,  a  completely  automatic  solution  of  the  phase  problem  for  structures 
containing  as  many  as  1000  atoms  when  data  are  available  to  atomic  resolution. 

By  combining  the  concept  of  frames  with  the  Zak  transform.  Josh  Zeevi  elucidates  the  use 
of  localized  bases  or  frames  in  the  representation,  processing,  compression  and  transmission 
of  speech,  images  and  other  natural  nonstationary  signals. 

Babar  Saffari  offers  a  fascinating  chapter  on  a  fascinating  subject:  extremal  problems 
involving  polynomials,  trigonometric  polynomials,  and  exponential  sums.  The  depth  and 
beauty  of  this  “elementary”  subject  come  alive  in  his  exposition. 

By  considering  function  algebras,  formal  power  series,  and  operator  algebras,  including  a 
quantitative  treatment  of  the  Weiner-Pitt-Sreider  phenomenon  for  convolution  measure  alge¬ 
bras  on  locally  compact  abelian  groups,  Nikolai  Nikolski  gives  an  in-depth  survey  of  recent 
results  on  the  phenomenon  of  the  “invisible  spectrum”  for  Banach  algebras. 

As  Hugh  Montgomery  makes  clear,  a  wide  variety  of  questions  of  harmonic  analysis 
arise  naturally  in  various  contexts  of  analytic  number  theory.  In  his  chapter,  Hugh  gives  a 
clear  exposition  of  a  number  of  examples  of  this  type. 

By  focusing  on  two  major  and  central  mathematical  areas,  the  radar  ambiguity  func¬ 
tion  and  radar  waveform  design.  Bill  Moran  beautifully  brings  forth  the  intimate  relationship 
between  harmonic  analysis  and  signal  processing.  It  is  difficult  to  imagine  a  more  perfect 
exposition  for  the  mathematician  interested  in  real-world  signal  design  and  analysis. 

In  the  past  20  years  or  so,  wavelets  have  emerged  as  one  of  the  central  topics  in  both  pure 
and  applied  harmonic  analysis.  Guido  Weiss  and  Edward  Wilson  present  a  broad  overview 
of  many  aspects  of  the  underlying  mathematical  theory.  A  highlight  is  their  recently  obtained 
characterization  of  “all”  wavelets  -  an  important  new  result. 

The  concluding  chapter  consists  of  several  lesser  known  (as  compared,  for  example,  to  the 
Riemann  hypothesis)  but  certainly  worthwhile  unsolved  problems  and  conjectures  of  current 
appeal  in  harmonic  analysis,  to  challenge  the  interested  twenty-first  century  reader. 

Numerous  giants  in  the  field,  including  about  half  of  the  lecturers  and  authors,  have  re¬ 
cently  retired  or  will  do  so  in  the  near  future.  A  major  purpose  of  the  ASI  was  to  afford  them 
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the  opportunity  to  join  together  to  share  their  profound  wisdom  with  the  many  future  stars  of 
pure  and  applied  harmonic  analysis.  A  second  purpose  was  to  produce  this  book  for  current 
and  future  generations,  highlighting  their  thoughts  and  insights. 

The  cooperation  of  many  individuals  and  organizations  was  required  in  order  to  make  the 
conference  the  success  that  it  was.  First  and  foremost  I  wish  to  thank  NATO,  and  especially 
Dr.  F.  Pedrazzini  and  his  staff,  for  the  initial  grant  and  subsequent  help.  Financial  support 
was  also  received  from  the  Air  Force  Office  of  Scientific  Research  (Dr.  Jon  Sjogren),  the  Eu¬ 
ropean  office  of  the  Office  of  Naval  Research  (Dr.  Igor  Vodyanoy),  the  Raytheon  Company 
(Dr.  Philip  W.  Cheney,  Chief  Scientist),  the  National  Science  Foundation,  the  Australian  Re¬ 
search  Council,  Philips  Research  Laboratories,  the  European  Mathematical  Society,  the  Uni¬ 
versity  of  Massachusetts  at  Boston,  and  Prometheus  Inc.  This  additional  support  is  gratefully 
acknowledged. 

I  wish  to  express  my  sincere  appreciation  to  my  assistants,  Marcia  Byrnes  and  Gerald  Os- 
theimer,  for  their  invaluable  aid.  I  am  also  grateful  to  Kathryn  Hargreaves,  our  T^nician,  for 
her  superlative  work  in  preparing  this  volume.  Finally,  my  heartfelt  thanks  to  the  II  Ciocco 
staff,  especially  Bruno  Giannasi  and  Alberto  Sufffedini,  for  offering  an  ideal  setting,  not  to 
mention  the  magnificent  meals,  that  promoted  the  productive  interaction  between  the  partic¬ 
ipants  of  the  conference.  All  of  the  above,  the  other  speakers,  and  the  remaining  conferees, 
made  it  possible  for  our  Advanced  Study  Institute,  and  this  volume,  to  fulfill  the  stated  NATO 
objectives  of  disseminating  advanced  knowledge  and  fostering  international  scientific  con¬ 
tacts. 

January  14, 2001 


Jim  Byrnes,  Newport,  Rhode  Island 
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The  Papers 


On  the  Uncertainty  Principle  in  Harmonic 

Analysis 


V.R  Havin 

Department  of  Mathematics  and  Mechanics 
St.  Petersburg  State  University 
Stary  Peterhof 

St.  Petersburg  198904  Russia 
havin@havin .nsr.pu.ru 

Abstract.  The  Uncertainty  Principle  (UP)  as  understood  in  this  lecture  is  the  fol¬ 
lowing  informal  assertion:  a  non-zero  "object"  (a  function,  distribution,  hyperfunc- 
tion)  and  its  Fourier  image  cannot  be  too  small  simultaneously.  “The  smallness”  is 
understood  in  a  very  broad  sense  meaning  fast  decay  (at  infinity  or  at  a  point,  bilateral 
or  unilateral),  perforated  (or  bounded,  or  semibounded)  support  etc.  The  UP  becomes 
a  theorem  for  many  “smallnesses”  and  has  a  multitude  of  quite  concrete  quantitative 
forms.  It  plays  a  fundamental  role  as  one  of  the  major  themes  of  classical  Fourier 
analysis  (and  neighboring  parts  of  analysis),  but  also  in  applications  to  physics  and 
engineering.  The  lecture  is  a  review  of  facts  and  techniques  related  to  the  UP;  connec¬ 
tions  with  local  and  non-local  shift  invariant  operators  are  discussed  at  the  end  of  the 
lecture  (including  some  topical  problems  of  potential  theory).  The  lecture  is  intended 
for  the  general  audience  acquainted  with  basic  facts  of  Fourier  analysis  on  the  line  and 
circle,  and  rudiments  of  complex  analysis. 

Introduction 

This  lecture  is  devoted  to  the  following  phenomenon  known  as  the  Uncertainty  Principle 
(the  UP): 

it  is  impossible  for  a  non-zero  function  f  and  its  Fourier  image  f  to  be  too  small  si¬ 
multaneously.  In  other  words,  the  approximate  equalities  /  ^  g,f  ^  g  cannot  hold  at  the 
same  time  and  with  a  high  degree  of  accuracy  unless  /  and  g  are  identical.  Gaining  some 
“certainty”  about  /  (in  the  form  of  a  good  approximation  g)  we  have  to  pay  by  the  uncer¬ 
tainty  about  /,  since  the  error  /  —  p  is  bound  to  be  considerable.  The  term  is  borrowed 
from  quantum  mechanics  where  it  is  usually  understood  as  the  Heisenberg  inequality  for 

3 
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the  wave  function,  but  in  the  present  text  it  is  interpreted  in  a  much  less  definite  sense;  this 
very  vagueness  makes  it  flexible  and  susceptible  to  a  multitude  of  rigorous  interpretations  (or 
refutations)  depending  on  a  concrete  kind  of  respective  “smallness”  of  /  and  /  mentioned  in 
its  statement.  Our  UP  can  be  patently  wrong  (e.g.  if  the  sizes  of  /  and  /  are  measured  in 
the  L^-norm  and  in  many  other  cases);  this  means  the  UP  can  be  sometimes  overcome,  and 
“small”  non-zero  pairs  (/,  /)  may  exist,  this  fact  being  also  one  of  our  themes.  Nevertheless 
the  UP  plays  an  outstanding  role  in  harmonic  analysis  and  its  applications  to  physics  and  en¬ 
gineering.  But  these  applications  won’t  be  discussed  here.  We  treat  the  UP  as  a  phenomenon 
of  pure  mathematics,  or,  to  be  more  precise,  classical  Fourier  analysis  (mainly  on  R  and  the 
unit  circle  T).  Our  theme  is  very  vast  and  can  be  looked  at  from  many  points  of  view;  ours 
will  be  that  of  pure  analysis.  The  concrete  forms  of  the  UP  to  be  considered  here  pertain 
mostly  to  quasianalyticity,  approximation  theory,  and,  first  of  all,  to  complex  analysis,  an 
abundant  source  of  concrete  manifestations  (and  disprovements)  of  the  UR  Among  the  omis¬ 
sions  of  this  lecture  are  the  operator  theoretic  approach  to  the  UP  (commutation  relations) 
and  the  modem  time-frequency  approach.  But  even  after  we  have  confined  our  discussion 
to  the  purely  analytic  aspects  of  the  phenomenon  we  are  still  left  with  a  huge  mass  of  facts, 
techniques,  and  approaches.  Thus  the  choice  of  what  is  to  be  discussed  was  inescapable  and 
difficult.  It  was  a  compromise  dictated  by  what  I  know  (or  don’t),  time  and  size  limitations, 
my  personal  predilections,  but  also  by  my  desire  to  publicize  impressive  results  obtained  in 
the  eighties  and  nineties  by  my  (partly  former)  colleagues  from  St.  Petersburg,  although  a 
good  deal  of  the  subsequent  text  is  quite  old  and  classical. 

To  describe  the  organization  of  the  lecture  let  us  first  introduce  some  notation.  Let  X  de¬ 
note  R  or  T  =  {^  €  C  :  =  1};  m  will  stand  for  Lebesgue  measure  on  X;  we  always  nor¬ 

malize  m  on  T  :  m(T)  1;  sometimes  we  write  \  A\  in  place  of  m{A).  The  Fourier  transform 
/  of  a  function  f  e  =  L^(X,  m)  is  understood  as  /(^)  =  (27r)“^  f  {t)  exp{-it()dt 
where  ^  €  R  or  Z  (i.e.  ^  6  X),  but  different  normalizations  of  /  can  also  occur  here  and 
there.  We  assume  the  reader  is  acquainted  with  Fourier  analysis  of  (tempered)  distributions. 
In  particular  our  most  frequent  symbols  related  to  a  distribution  T  on  X  will  be  supp  T  {= 
the  closed  support  ofT)  and  spec  T  =  supp  T,  the  spectrum  of  T.  Now  we  try  to  bring  some 
order  into  the  heap  of  “smallnesses”  to  be  used  below  (see  the  statement  of  the  UP  above). 
We  compose  a  list  of  properties  of  a  function  (or  distribution)  on  X : 

Si  (/)  (“fast  bilateral  decay  of  /  at  infinity”);  /  is  defined  on  R  or  Z  and  satisfies 

f{t)  =  0{M{t)),  |i|  -1-00 

where  M  is  a  given  majorant,  lim|f|_^+oo  M{t)  =0: 

S2{f):  replacing  \t\  -l-oo  in  Si(f)  hyt-^  +oo  (or  t  ~oo)  we  get  fast  decay  at  -oo 
(or  -l-oo); 

^s{f)  (“^  deep  zero  of  /  at  a  point  Xq  G  X”):  f{t)  =  0{M(t))^t  rco,  where 
\imt-^xo  =  0; 
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S^{T)  (“a  sparse  support”)  :  X  \  suppT  is  non-empty  and  more  or  less  “rich”  (say, 
consists  of  many  long  intervals  or  arcs), 

but  very  interesting  forms  of  the  UP  arise  even  when  X  \  supp  T  is  connected,  this  is  why 
the  following  three  properties  are  stated  separately: 

iS5(T)  (“a  gap  in  the  support”):  suppT  omits  a  non-degenerate  interval  (or  an  arc)  of  X; 
SeiT)  (“a  bounded  support”):  supp  T  is  bounded  (this  case  refers  to  X  =  R); 

Se{f)  means  that  T  is  “band  limited”. 

57(T)  (“a  sembounded  support”):  suppT  c  [0,  +00)  (or  c  (-00, 0]);  the  spectral  prop¬ 
erty  57 (T)  is  absolutely  fundamental  for  our  subject  and  deserves  special  attention. 

To  conclude  this  list  we  have  to  define  the  so-called  logarithmic  integral  C(f)  of  a  func¬ 
tion  /  defined  on  X : 

£(/)  =  j  ]og\f\dm,ifX  =  T,C{f)  =  j  log|/|dn,ifX=:R 

T  R 

where  11  is  the  Poisson  measure  7r“^(l  4-  Our  last  “smallness  condition”  means 

the  geometric  mean  of  /  (w.r.  to  m  on  T  or  11  on  R)  is  zero;  it  looks  less  natural  than  its 
predecessors,  but  in  fact  it  is  responsible  for  many  variants  of  the  UP,  and  is  omnipresent  in 
many  books  on  Fourier  analysis,  complex  analysis  and  probability: 

Seif)  ■■  C{f)  =  -00. 

A  discrete  logarithmic  integral  C{f)  of  a  function  defined  on  Z  will  also  play  a  role:  this  time 

m  =  E 

nez 

Now  we  are  in  a  position  to  specify  (slightly)  our  main  question: 
given  j  and  k  =  1, 2, . . , ,  7  or  £,  is  it  true  that  Sj{T)  and  Sk{T)  imply  the  complete 
vanishing  ofTl 

The  conditions  Sj  being  still  vague,  the  answer  to  such  an  “(5^  ,  5^) -question”  depends 
heavily  on  the  concrete  relations  between  the  time  condition  Sj  (/)  and  the  frequency  condi¬ 
tion  Sk{f)\  some  combinations  of  j  and  k  may  even  not  admit  a  satisfactory  answer  at  all. 
But  a  remarkable  fact  is  the  existence  of  good  answers  to  many  such  questions,  the  answers 
being  sharp  and  verifiable.  The  list  of  the  S'* -conditions  looks  dull  (something  like  bookkeep¬ 
ing)  and  formal,  but  the  diversity,  variety  and  beauty  of  the  ideas  and  tools  required  to  answer 
at  least  some  of  our  questions  is  quite  amazing.  Note  that  “qualitative”  questions  [Sk^Sj) 
usually  entail  some  “quantitative”  problems  resulting  in  useful  and  explicit  estimates. 

The  lecture  consists  of  three  parts.  Part  1  is  a  collection  of  results  not  requiring  any 
use  of  complex  analyticity;  we  tiy  to  sketch  (or  at  least  allude  to)  some  proofs  (when  they 
are  simple).  This  policy  becomes  almost  impossible  in  Part  2  based  on  complex  analyticity. 
The  “complex”  proofs  usually  involve  a  good  portion  of  hard  analysis,  so  Part  2  is  mainly 
a  collection  of  results  accompanied  by  some  comments  (kind  of  a  guided  tour).  Part  3  is 
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devoted  to  some  remote  repercussions  of  the  UP:  description  of  symbols  of  local  and  non¬ 
local  shift  invariant  operators;  some  closely  related  topical  problems  stemming  from  potential 
theory  are  also  discussed. 

Our  theme  is  present  more  or  less  explicitly  in  any  course  on  Fourier  analysis  (  [Z,  Ba, 
Katz,Kah]),  quasianalyticity  ( [M]),  and  complex  analysis  ( [Bo,Du,Pr,Ho,Ga,Kool]).  The 
books  [PW],  [B],  [L],  [Lev],  [KahS],  [DeBr],  [Koo2],  [Koo3],  [Car],  [Carl],  and  [Nik]  are 
especially  close  to  our  theme  and  have  influenced  our  exposition  in  many  ways. 

The  present  lecture  is  mainly  based  on  [HJ]  just  reproducing  some  of  its  parts  in  a  very 
compressed  form;  we  often  refer  to  the  bibliography  therein.  I  also  want  to  mention  the  long 
article  [Na]  with  its  impressive  amount  of  excellent  results. 

1.  The  UP  without  Complex  Variables 
1.1.  On  functions  with  semibounded  spectra 

Let  L  be  a  linear  shift  invariant  operator  on  the  time  line  R  defined  on  a  vector  space 
of  functions  (or  distributions).  The  generic  form  of  L  is  the  convolution  with  a  function 
(distribution)  a: 


Lf  =  a*f]  a  =  L{6). 

We  may  interpret  L  as  a  device  transforming  inputs  /  into  outputs  Lf.  We  say  L  obeys 
the  causality  principle  (or  is  causal)  if  L/|(— oo,  io)  for  any  given  moment  to  depends  only 
on  /|(— oo,  fo)  (“no  output  without  an  input”),  or  what  is  the  same  a|(— oo,  0)  =  0.  In  the 
Fourier  coordinates  the  action  of  L  becomes 


Lf  =  d^f 

which  means  Lf  is  a  frequency  filter.  The  causality  imposes  severe  restrictions  on  the  spec¬ 
tral  characteristic  d  of  L  :  a  cannot  suppress  too  many  frequencies  unless  L  =  0.  This 
phenomenon  stems  from  the  analytic  continuability  of  d  into  a  half-plane  (due  to  the  semi¬ 
boundedness  of  supp  a).  This  complex  variable  explanation  will  be  one  of  the  themes  of  Part 
2.  Actually  one  can  explain  many  properties  of  d  in  the  causal  case  staying  on  the  line  and 
ignoring  the  existence  of  the  complex  plane.  One  of  these  properties  is  the  Jensen  inequality 
for  plus-functions  (i.e.  for  functions  with  positive  spectra). 

It  is  convenient  to  interchange  time  and  frequency  lines  and  concentrate  on  the  objects 
with  semibounded  spectra  (rather  than  supports).  We  start  with  the  periodic  case:  suppose 
/  G  i/(n,m)  =  i/P(n),  1  ^  p  ^  -1-00,  so  that  /  lives  on  Z  :  f{n)  =  f  z”dm,  n  6  Z. 
We  say  that  /  is  in  the  Hardy  class  //^(T)  if  its  spectrum  is  non-negative:  spec/  C  Z+ 
(sometimes  we  call  such  an  /  a  plus-function). 

For  a  probability  measure  p  in  a  measure  space  X  we  put 
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A^(f)(=A{f))  =  Jlfldix,  ff^(/)(=e(/))  =exp yiogl/ld/i, 

X  X 

thus  defining  the  arithmetic  and  geometric  means  of  /.  Note  that  the  meaning  of  the  two 
“smallnesses” 


A(f)  =  0  and  g{f)  =  C(f)  =  j \og\f\dn  =  -oo) 

X 

is  very  different:  the  first  just  means  /  =  0  a.e.  whereas  the  second  is  implied  by  fjL{{f  — 
0})  >  0  or  (depending  on  p)  by  a  fast  decay  of  /  at  a  point  of  X.  By  the  Jensen  inequality 
for  the  means  we  always  have 

(1)  Q{f)  <  A(f). 

Let  us  now  go  back  to  X  =  11,  jjl  =  m.  Clearly,  |/(0)|  =  |  f  /c?m|  <  >A(f)  for  any 
/  e  But  another  Jensen  inequality  asserts  that 

(2)  l/(0)Ka(/).  if  feH^{T) 

This  crucial  fact  has  many  far-reaching  implications  pertaining  to  the  UP.  So,  for  example, 

/  G  H\T)  &  Gif)  =  0=>f  =  0 
(Gif)  =  0  kills  /(O),  but  then  it  kills  any  f(n)).  But  then 

&  |{/=.o}|>o^/  =  o 

(total  absence  of  negative  frequencies  is  not  compatible  with  vanishing  on  a  set  of  positive 
length).  For  a  proof  of  (2)  see,  e.g.,  [HJ],  p.  34;  it  is  quite  short  and  elementary. 

By  a  Mobius  change  of  variables  we  get  the  following  version  of  (2):  If  f  E  L^(E)  and 
spec  /  C  [0,  H-oo),  then 


(3)  I  j  fdU\  <  Qnif) 

R 

if  is  regarded  here  as  a  tempered  distribution,  so  /  and  spec  /  make  sense;  11  is  the  Poisson 
measure,  see  the  Introduction).  In  particular  (3)  is  valid  for  any  plus-function  /  6  Lp(E) 
(w.r.  to  m),  1  <  p  ^  -i-oo  (i.e.  if  /  €  if^(R)  =  {/  €  //(R)  :  spec  /  C  [0,  -hoo)}.  It  is  easy 
to  deduce  from  (3)  that  /  G  iJ^(R)  &  Gnif)  =  0  /  =  0  so  that  if  /  G  /f^(R),  /  ^  0, 
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then  it  cannot  decay  too  fast  at  -foo  or  —oo  or  at  any  finite  point,  and  it  cannot  vanish  on  a 
set  of  positive  length.  This  last  property  is  stable  in  the  following  sense: 

The  Theorem  on  two  Constants.  For  any  f  e  and  any  Lebesgue  measur¬ 

able  S  cR 

(4)  l(/*n)(x)|  <  e  R 

5'  =  R\5,n^(£;)  =  n(£:-x)). 

The  proof  is  a  straightforward  combination  of  two  Jensen  inequalities  (1)  and  (3).  Note 
that  na:(5)  is  the  angle  (divided  by  tt)  under  which  S  is  seen  from  and  f*llx  =  P(f)(x) 
is  the  Poisson  integral  of  /  (i.e.  the  bounded  solution  of  the  Dirichlet  problem  for  the  upper 
half-plane  €+  with  the  boundary  function  /)  computed  at  a:  +  Clearly,  P{f)  =  0  implies 
/  =  0;  (4)  shows  that  if  a  plus-function  /  is  globally  bounded  (say,  |/|  ^  1)  and  very  small 
on  S'  (say,  |/|  ^  5),  then  P(f)  is  small  globally:  |P(/)(a;)|  ^  for  any  real  x. 

We  will  also  need  an  integral  version  of  this  result:  suppose  7  >  0,  and  nx(5)  ^  7 /or 
any  x  eR;  if  f  £  H^(R),  then 

(4')  I \P{f)fdm  <2(1  |/|^dm)^||/||f 

R  S 

where  ||  II2  denotes  the  L^{m)-norm  ( [HJ],  p.  40). 

The  logarithmic  integral  C^(f)  figuring  in  ^^(/),  C^{f)  =  f  log  \f\dp  ^  log^p(/)  for 

X 

X  =  U,  p  =  m  or  X  =  E,  p  —  U  plays  an  outstanding  role  in  many  problems  concerning 
the  UP  (not  only  for  semibounded  spectra!).  The  two  conditions 

£(/)  =  —00  and  £(/)  >  -00 

define  two  separate  realms:  in  the  first  one  the  rule  of  the  UP  is  indisputable  whereas  in  the 
second  it  can  be  sometimes  resisted  (see  [HJ],  but  especially  [Koo2]). 

1.2.  Hilbert  Space  methods 

1.2.1.  Annihilating  pairs  of  sets.  For  a  function  f  G  the  set  {x  G  :  f(x)  / 

0}  is  called  the  essential  support  of  f  and  denoted  by  esssupp  /;  it  is  defined  up  to  a  set  of 
zero  Lebesgue  measure.  The  essential  spectrum  of  f  is  defined  as  ess  supp  /  and  denoted  by 
ess  spec  /  (/  is  understood  in  accordance  with  the  Plancherel  theorem), 

A  pair  (5,  T.)  of  Lebesgue  measurable  sets  in  R^^  is  said  to  be  annihilating  (or  a-pair)  if 

/  G  ess  supp  f  C  S,  ess  spec  /  C  L  =>  /  =  0. 


(5) 
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The  following  property  of  (5,  E)  is  more  interesting:  we  say  that  (5,  S)  is  a  strong  a-pair  if 

(6)  yi/p<c(s,E)(yi/p+|i/T) 

Rd  S'  S' 

for  any  f  £  L?'  {A*  denotes  \^).  The  annihilation  property  (5)  of  a  strong  a-pair  is  “stable”: 
(5)  only  means  that  vanishing  of  /|5'  and  /|E'  implies  global  vanishing  of  /  whereas  (6)  says 
that  the  smallness  of  /|<S",  /|E'  implies  the  global  smallness  off. 

The  d-dimensional  Lebesgue  measure  of  a  set  A  C  E'^  will  be  denoted  by  |  A|. 

The  following  version  of  the  UP  can  be  proved  using  only  the  basic  properties  of  the 
Fourier  transform  and  very  general  properties  of  projectors  in  a  Hilbert  space: 

The  Amrein-Berthier  theorem..  If 

jiS|  -}-  jE|  <  -f-CXD, 

then  (S^T,)  is  a  strong  a-pair. 

Note  that  the  sets  5,  supposed  to  be  bounded.  We  are  going  to  sketch  a  proof 

based  on  two  orthogonal  projectors  P5,  Ps  of 

Psf  =  Xsf,Hhf)^xJ 

where  xa  denotes  the  characteristic  function  of  the  set  A  C  E*^  and  T  is  the  Fourier  transform 
in  E'^  (duly  normalized  to  define  a  unitary  operator  in  I?).  The  proof  is  sketched  in  2.3  after 
some  preparation  in  2.2. 

1.2.2.  Positive  angle  between  two  subspaces.  Let  us  now  forget  the  concrete  nature  of  these 
projectors  and  move  to  an  abstract  Hilbert  space  H\  let  (M,  N)  be  a  pair  of  its  closed  sub¬ 
spaces.  We  denote  by  P  and  Q  the  projectors  of  H  onto  M,  N  (resp.).  We  are  interested  in 
the  following  property  of  the  pair  (M,  N)  (or  (P,  Q))\ 

(7)  ll/ill"  <  c{M,  Af)(l|P-^ft|p  +  for  any  heH, 

where  =  I  —  P,Q^  =  I  —  Q  project  onto  the  orthogonal  complements  of  M  and  N 
(resp.).  Clearly,  (7)  is  an  abstract  form  of  (6).  It  can  be  given  several  equivalent  forms: 

(а)  |1PQ||(=  IIQPII)  <  1; 

(б)  sup{|  <m,n>  \  :  m  £  M,n  £  N,  ||m||  ^  1,  ||n||  ^  1}  <  1; 

(8)  {c)  M  nN  =  {0},  and  M  -I-  A"  is  closed; 

(d)  l|P‘^n||  ^  c||n||  for  any  n  e  N  (or,  equivalently,  ||Q''"n||  ^  c\\m\\ 
for  any  m  £  M),  c  >  0. 

If  M,  N  are  of  finite  dimension,  then  all  these  properties  just  mean  M  n  N  =  {0},  but  in 
general  this  last  property  alone  does  not  imply  (8)  which  is  often  expressed  as  “the  positivity 
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of  the  angle  between  M  and  N  ”  (look  at  (b)).  A  proof  of  the  equivalence  of  (7)  and  all 
properties  in  (8)  is,  e.g.,  in  [HJ],  p.  80. 

The  following  general  observation  is  crucial  for  the  Amrein-Berthier  theorem:  If  MnN  = 
{0},  and  PQ  is  compact,  then  (7)  holds.  Indeed,  PQ  being  compact  there  is  a  unit  vector 
veH  such  that  \\PQv\\  =  \\PQ\\;  if  ||PQ||  =  1,  then  1  =  ||PQ^;||  ^  \\Qv\\  ^  ||?;||  z.  i 
whence  ||Qt;||  =  ||t;||  and  Qv  =  v,  so  that  v  G  N  stnd  ||Pt;||  jib'll,  €  M  whereas  the  only 
element  of  M  0  is  zero. 

1.2.3.  Let  us  now  return  to  =  L^(E^)  and  put  P  =  Ps^Q  =  Pe,  M  —  {/  e  : 
esssupp/  C  S'},  =  {/  €  :  ess  spec  /  C  E}.  If  |S|,  |E|  are  finite,  then  PQ  becomes  an 
integral  operator  in  P^  with  the  kernel  H{x,  y)  =  cxs{^)x^{y  -  x)  which  is  Hilbert-Schmidt: 
by  Plancherel 


JJ  \H{x,y)\^dxdy  =  c^  j  x|  • /xl  =  c^|5||S|  < +oo. 

If  f  e  M  ON  then  /  is  an  eigenvector  of  PQ  corresponding  to  the  eigenvalue  1,  so  M  fi  A 
is  finite  dimensional.  Moreover,  its  dimension  can  be  estimated  by  |S|  |E|  : 

(9)  dim(MniV)  ^  jj  \H(x,y)\'^dxdy  = 

Using  this  estimate  it  is  not  hard  to  prove  that  MnN  —  {0}  (i.e.  that  (S,  E)  is  an  a- 
pair),  and  thus  complete  the  proof  of  the  theorem.  Suppose  (p  e  M  n  N^(p  ^  0,  so  that 

50  C  S,  0  <  |So|  where  So  =  esssupp  (p.  For  a  vector  v  put  (pv{x)  —  (p{x  -  v).  For  a 
Vi  €  the  essential  support  Si  of  (p^^  (i-e.  So  4-  Vi)  sticks  out  of  So  (slightly): 

0  <  l^'l  \  Sol  <  £i 

where  ei  >  0  is  arbitrary  (we  are  using  the  finiteness  of  |So|);  functions  p,  (p^^  are  linearly 
independent.  Then  we  find  V2  €  R^  so  as  to  make  S2  =  Si  4-  wi  =  ess  supp  slick  out  of 

51  U  So  (slightly): 

0  <  IS2  \  (Si  U  So)|  <  £2; 

(pi  (Pvi  5  (Pviv2  linearly  independent.  Continuing  this  process  we  arrive  at  an  infinite  linearly 
independent  sequence  of  shifts  of  ip\ 

(10)  (Pi(Pv\•^(PvlV2^  ’  •  ’ 

and  the  sequence  of  sets  So,  So  4-  Vi,  So  4-  ui  4-  f2,  •  -  whose  union  S*  is  of  finite  measure 
if  only  <  4-oo.  The  essential  spectrum  being  shift  invariant  the  sequence  (10)  is  in 
M*  nN,M*  =  Ps*{L‘^)iN  =  Pe(P^)  which  is  impossible  (dim(M*  fl  AT)  ^  c^|S*||E|). 
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1.2.4.  Some  remarks  on  strong  annihilation.  Using  the  equivalence  of  (7)  and  (8)  and  staying 
in  an  abstract  Hilbert  space  H  we  may  solve  a  series  of  problems  pertaining  to  the  UP.  For 
example,  if  (8)  holds,  then  the  operator  v  {Pv^  Qv)  maps  H  onto  M  xN.  This  means  that 
whenever  (5,  S)  is  a  strong  a-pair  the  following  system  of  equations  (with  “the  unknown” 
r  G  L^)  is  solvable: 

r\S  =  p\S,r\E  =  q\J: 

for  any  p,q  e  L^.  Another  example  is  the  following  problem:  describe  the  image  of  the  unit 
ball  of  H  under  the  mapping  /i  ->•  (||F/^|| ,  ||Q/i||)  G  .  This  problem  can  be  solved  quite  ex¬ 
plicitly  for  many  pairs  (P,  Q)  such  that  PQ  is  compact;  the  result  is  a  quantitative  refinement 
of  the  Amrein-Berthier  theorem  (the  Slepian-Pollack  inequality):  roughly  speaking  the  point 
(|lP/i||,  \\Qh\\)  of  the  square  [0, 1]  x  [0, 1]  cannot  get  too  close  to  the  vertex  (1, 1).  For  a  pair 
of  sets  5,  E  of  finite  measure  the  Slepian-Pollack  inequality  answers  the  following  question: 
suppose  h  G  \\h\\  =  1,  and  /  |/ip  =  a  with  a  given  a  G  (0, 1);  how  large  can  f  |/p 

5  E 

be  ?  It  turns  out  that  the  least  upper  bound  of  this  “spectral  energy  carried  by  E”  is  one  if 
a  ^  c(S,  E)  <  1,  but  it  does  depend  on  a  G  (c(5,  E),  1)  remaining  strictly  less  than  one. 

1.2.5.  The  Paneah  Theorem.  The  definition  of  a  strong  a-pair  suggests  the  following  general 
question:  given  a  class  s  of  measurable  sets  5  C  R^^  find  die  class 

s  =  {E  C  R'^  :  (5,  E)is  a  strong  a-pair  for  any  5  G  s}. 

Denoting  by  the  class  of  all  sets  in  R^^  of  finite  Lebesgue  measure  we  may  restate  the 
Amrein-Berthier  theorem: 

(11)  ^ 

It  is  known  that  this  inclusion  is  strict;  I  do  not  know  any  satisfactory  and  complete  description 
of  Sfin.  Let  us  turn  instead  to  an  important  example  of  s  when  s  admits  a  complete  and  explicit 
description:  5  =  =  the  class  of  all  bounded  measurable  sets  in  R^ .  We  say  that  a  Lebesgue 

measurable  set  P  C  R^  is  relatively  dense  (at  infinity)  (or  belongs  to  Srd)  if  there  exists  a  cube 
K  and  a  number  7  >  0  such  that  \{K-\-x)nE\  ^  7  for  any  x  G  R'^ .  A  typical  example 
(for  d  =  1)  is  the  union  of  all  intervals  of  a  given  positive  length  centered  at  equidistant 
points  nh  where  h  >  Ois  fixed  and  n  G  Z.  The  rd-sets  P  C  R  can  be  characterized  by  the 
following  property:  the  observer  moving  along  the  line  y  =  1  sees  E  all  the  time  under  an 
angle  exceeding  a  positive  number  in  other  words 

1  P  di 

(12)  n*(E)  =  -  /  - - ^  ^  CT  for  any  a;  6  R 

TT  J  1  +  [X  —  ty 
E 

Denote  by  5'^  the  set  of  all  complements  of  the  rd-sets.  The  following  theorem  refers  to 
d  =  1  (i.e.  to  R). 

The  Paneah  theorem.  Sb  = 
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(Paneah  proved  C  for  any  dimension;  the  inverse  inclusion  in  any  dimension  was 
proved  by  Logvinenko  and  Sereda  later.)  Here  we  sketch  a  short  proof  of  a  part  of  the  Paneah 
theorem  ( [JH,  Gor,  HJ]),  namely  sj.^  C  h- 

First  note  that  the  Poisson  integral  P{(p)  =  of  G  is  again  in  since 

(13)  (FV)(0  =  e-l«l<^(0 

(where  =  (27r)“^  f  ^(x)  exp(—is^)dx)  whence 

K 

If  spec  (p  C  [0,  /],  then  an  inverse  estimate  can  be  obtained:  |f|  in  (13)  becomes  f  so  that 

1^(01  =  e«|P(V)(0|<6'|P(V){$)|, 

and 


(14)  ll^lh  <  e'llP(y)||2 

by  Plancherel.  Suppose  now  /  G  L^(R),spec/  C  [a,b];  but  then  (14)  is  applicable  to 
(p  =  Q-iaxj  (Yvith  I  =  b  -  a).  Applying  the  integral  form  of  the  two  constants  theorem  to  an 
rd-set  E  (see  (4')  and  (12))  we  get 


whence  ||/||2  <  l/P*  We  have  thus  proved  (8d)  for  P  =  P5,  =  E\  and  Q  = 

P[a,!>]  which  means  that  (5,  [a,  6])  is  a  strong  a-pair. 


1.2.6.  Periodic  case:  strong  annihilation  of  supports  omitting  a  set  of  positive  length  and 
sparse  spectra.  The  problem  setting  of  1.2.1  has  obvious  L^(T) -parallels,  the  corresponding 
definitions  of  a-pairs  and  strong  a-pairs  (5,  E),  5  C  T,  E  C  Z  being  essentially  the  same. 
E.g.,  (S',  E)  is  a  strong  a-pair  if 

l/pdm  +  |/(n)p  I  for  any/  e  L^(T). 

ngE'  / 

This  case  can  be  also  included  into  the  general  scheme  of  1.2.2. 

Let  5t  be  the  class  of  all  sets  S  C  T  satisfying  m[S)  <  1;  denote  sj  by  SPARSE.  A  deep 
and  difficult  result  on  SPARSE  is  due  to  Mikheev  who  proved  that 

A*(2)  C  SPARSE  C  A(2) 

where  A*  (2),  A (2)  are  certain  classes  of  rarefied  sets  of  integers  (A (2)  is  very  familiar  to  the 
specialists;  see  [HJ],  p.  102-110  for  definitions).  Here  we  only  mention  that  it  is  unknown 
whether  A*  (2)  =  A(2).  It  is  known  however  that  the  finite  unions  of  lacunary  sets  are  in 
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A*  (2)  (a  set  A  of  positive  integers  is  called  lacunary  if  sup{m/n  :  m,  n  6  yl,  m  <  n}  <  1; 
a  set  A  of  integers  is  called  lacunary  if  A  H  (0,  H-oo)  and  {|n|  :  n  €  A,  n  <  0}  are  lacunary). 
The  strong  annihilation  of  pairs  {S,  E)  with  m{S)  <  1  and  lacunary  E  (but  not  a  finite  union 
of  lacunary  sets)  was  proved  by  Zygmund. 

1.3.  Review  of  some  ‘'non-complex**  results 

The  main  source  of  concrete  forms  of  the  UP  is,  of  course.  Complex  Analysis.  This  was 
avoided  (or  carefully  masked)  in  the  preceding  parts  of  the  lecture.  Before  turning  to  the 
powerful  complex  machinery  I  just  want  to  mention  some  more  forms  of  the  C/P  susceptible 
to  other  methods. 

1.3.1.  The  Annrein-Berthier  Theorem  revisited.  To  describe  a  new  approach  to  this  the¬ 
orem  we  start  with  a  proof  (due  to  Benedicks)  of  an  -analog  of  annihilation  of  pairs 
(5,E),5,E  C  R,15|  +  |E|  <  +oo:  iff  e  L^{R){=  L^)J\S'  =  0,/|E'  -  0,  then  f  =  0. 
The  proof  is  based  on  two  facts: 

(i)  If  E  C  R,  |E|  <  +00,  then  for  m-almost  all  /i  €  (0,  +oo)  almost  all  points  of  the 
lattice  {kh)kez  (i*e.  all  but  a  finite  number)  avoid  E  ( [HJ],  p.  456) 

(ii)  For  f  £  (R)  put 

pW  = 

fcez 

the  series  converges  in  L^{{- A,  A))  for  any  A  >  0  thus  defining  a  1 -periodic  function 
p  summable  on  (0, 1),  the  periodization  off.  Put  S  =  ess  supp  f,S  =  ess  supp  n(0, 1) 
it  is  easy  to  see  that 

(15)  m[S)  ^  m[S). 

The  e-compression  fe  of  /  is  defined  by  fe[x)  =  /(a:/e);  by  we  denote  the  periodization 
of  /e.  The  k-th  Fourier  coefficient  of  (w.r.  to  the  system  (exp(27rzA;j;))  is  ef(ek). 
Hence,  by  (i),  p^^^  is  a  trigonometric  polynomial  of  period  l/e  for  almost  all  e  provided 
/I  =  0, 1  ^  I  <  oo.  So  for  any  f  £  (R)  and  e  0 

( [HJ],  p.  458).  Now  we  are  ready  to  complete  the  proof:  a  sequence  (p^^''^)x  of  trigono- 

metric  polynomials  (with  — >■  0)  tends  to  /  in  but  if  Sk  is  small,  then  p(^*=^  vanishes  on 
(0, 1)  \  a  set  of  positive  length,  since  |  ^  £fc|5|  (by  (15))  whence  =  0. 

In  the  same  spirit,  but  in  a  much  more  quantitative  way  Nazarov  found  an  explicit  estimate 
of  the  constant  c  in  the  Amrein-Berthier  inequality  (6).  The  abstract  proof  discussed  in  1.2.2- 
1.2.3  did  not  yield  any  information  on  c;  it  was  not  even  clear  whether  c  depends  on  l-S'l,  |E| 
rather  than  on  5,  E. 
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The  Nazarov  Theorem.  The  Amrein-Berthier  inequality  (6)  (for  d  =  1)  holds  with 
c  =  A  expyl|5||E|  where  A  is  an  absolute  constant. 

Writing  (6)  for  the  Gauss  function  f{x)  =  exp(-a:^/2),  5  —  E  :=  [-N,N]  we  find 
c  ^  exp  A'lS'ljEl  for  an  absolute  A'.  Note  that  in  fact  c  tends  to  one  as  |5||E|  ^  0  which  fact 
can  be  easily  deduced  from  the  abstract  geometric  considerations  of  1.2.2  and  the  estimate 
||PsP^|P  ^  con5i|5||S|;  for  |5||E|  bounded  off  zero  the  first  factor  A  in  Nazaror’s  estimate 
can  be  dropped. 

An  elementary  probabilistic  analysis  of  “random  lattices”  in  the  spirit  of  Benedicks  argu¬ 
ment  led  Nazarov  to  a  “finite”  version  of  the  UP  which  is  interesting  in  itself.  In  a  particular 
case  it  was  discovered  by  Turan  in  the  fifties. 

The  Turan  Lemma.  Let  P  be  a  trigonometric  polynomial 

spec  P  being  a  finite  set  of  integers.  Put  ord  P  =  card  spec  P.  There  exists  an  absolute 
constant  C  such  that 

(16)  max  |P|  ^  ((7/m(7))‘'’'‘^^max  |P| 

for  an  arbitrary  arc  7  C  T. 

Note  that  ordP  ^  degP  =  max{|n|  :  n  €  spec  P},  and  (16)  is  an  essentially  non-linear 
result,  since  the  set  of  all  P’s  with  a  given  ordP  is  not  a  linear  space.  Turan’s  original  proof 
was  based  on  some  explicit  interpolation  formulas.  Nazarov  needed  (16)  not  for  arcs  7,  but 
for  arbitrary  compact  subsets  of  T,  and  he  succeeded  in  proving  (16)  for  this  more  general 
situation  which  required  a  new  approach  involving  the  Kolmogorov  weak  type  estimate  of  the 
Hilbert  transform.  He  proved  along  the  way  that  if  spec  P  C  [— M,  M],  then  for  any  t  >  0 

|{CeT:|F'(C)|>iM|P(C)|}KC„i»/i, 

so  that  the  Bernstein  norm  estimate  of  the  derivative  of  a  trigonomtric  polynomial  P  of  degree 
M  (max  |P'|  ^  Mmax  |P|)  holds  pointwise  off  a  set  of  small  measure  (^  Cabs/t)  with  tM 

in  place  of  M. 

1.3.2.  The  F.  and  M.  Riesz  Theorem,  originally  appeared  and  was  perceived  as  a  fact  of 
Complex  Analysis,  but  later  it  was  given  several  proofs  not  using  analytic  functions.  The 
theorem  states  that  a  charge  (=  a  complex  valued  Borel  measure)  on  R  or  T  with  positive 
spectrum  (a  plus-charge)  is  m-absolutely  continuous.  Of  those  non-complex  proofs  I  mention 
here  only  one  due  to  A.B.  Aleksandrov  and  J.  Shapiro  and  based  on  peculiarities  of  the  LP- 
metric  with  p  e  (0, 1)  restricted  to  trigonometrical  polynomials  non-negative 

nJrO 

spectrum.  This  approach  is  applicable  to  the  charges  on  a  multidimensioinal  torus  ( [HJ],  p. 
41-50).  An  interesting  quantitative  version  of  the  F.  and  M.  Riesz  theorem  is  due  to  Pigno, 
Smith  ( [HJ],  p.  23-28). 
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1.3.3.  The  De  Leeuw-Katznelson  Theorem.  The  Fourier  coefficients  (i{n)  of  a  plus-charge 

on  T  tend  to  zero  as  |n|  -\-oo  (an  immediate  corollary  of  the  F.  and  M.  Riesz  theorem). 

This  property  is  stable:  the  De  Leeuw-Katznelson  theorem  states  that  for  any  s  >  0  there  is  a 
(^  >  0  such  that  for  any  plus-charge  //  on  T  with  varp,  <  1 

limsup  |/i(n)|  <  5  limsup  |/i(n)|  <  e. 

n^-oo  n-^+oo 

The  proof  is  quite  “real”  ( [HJ],  p.  29-31). 

1.3.4.  Spectral  decay  of  singular  charges.  An  m-singular  charge  p  (on  T)  is  highly  concen¬ 
trated;  the  UP  suggests  its  Fourier  image  p  should  be  “spread”,  but  to  what  extent?  E.g.,  is  it 
possible  for  p  to  tend  to  zero  (in  which  case  we  call  it  an  r-charge  in  honour  of  Rajchman)? 
The  answer  is  “Yes.”  In  this  connection  I  mention  a  beautiful  result  due  to  Salem  (preceded 
by  a  more  elementary  partial  result  due  to  Bari)  characterizing  the  Cantor  subsets  of  T  whose 
Cantor  measure  is  an  r-measure  (  [HJ],  p.  63,  86).  Another  device  to  produce  singular  r- 
measures  are  infinite  Riesz  products  ( [Z,HJ];  a  very  nice  treatment  of  the  Riesz  products  is 
in  [Pey]). 

An  obvious  spectral  obstacle  for  a  charge  //  on  T  to  be  singular  is  the  inclusion  p  €  P{Z). 

The  Ivashev-Musatov  Theorem  asserts  that  this  fact  is  sharp:  for  any  “nice”  non-negative 

00 

function  $  defined  on  [0, -hoc)  with  ^$^(n)  =  -f-oo  there  exists  a  non-zero  m-singular 

1 

positive  measure  p  on  T  with  compact  support  such  that  \p(n)\  <  $(|n|)  for  any  n  G  Z  (we 
are  not  in  a  position  to  discuss  here  “the  nicety”  of  dropping  the  compactness  of  supp  p 
from  the  statement  above  we  may  just  assume  ^  to  be  decreasing  ( [Kor]).  The  proof  is  based 
on  ingenious  asymptotic  estimates  of  oscillating  integrals  in  the  spirit  of  the  Van-der-Corput 
lemmas. 

1.3.5.  Deep  zero  <Sc  sparse  spectrum.  Suppose  /  €  C(T),e  >  0, /(i)  =  0(exp(“|t  — 

l|-(i+e)))  as  i  1  (“a  deep  zero  at  1”)  and  ^  <  -t-oo  (“a  sparse  spectrum”);  then 

/  =  0.  This  is  a  very  particular  case  of  the  Mandelbrojt  theorem.  It  was  given  a  purely  “real” 
proof  by  Belov  (  [HJ],  p.  80-85).  This  proof  is  interesting  in  itself  providing  some  useful 
quantitative  relations.  But  the  complex  approach  to  the  Mandelbrojt  theorem  results  in  its 
much  stronger  forms  and  seems  to  be  the  only  way  to  prove  its  sharpness. 

2.  Complex  Methods 

2.1.  Introductory  Remarks 

2.1.1.  Partial  sums  of  the  Fourier  series  on  T  are  rational  functions,  and  partial  Fourier 
integrals  on  M  are  entire  functions;  both  live  not  only  on  T  or  M,  but  in  the  whole  ambient 
plane  C.  Leaving  R  and  T  for  C  we  get  a  vast  new  perspective  making  many  manifestations 
of  the  UP  just  some  uniqueness  theorems  of  Complex  Analysis.  A  (very  primitive)  example 
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is  this:  if  the  support  of  a  non-zero  charge  /2  on  R  is  bounded,  then  spec  p,  is  not,  since  ft  is  an 
entire  function.  We  can  of  course,  strengthen  this  trivial  remark  replacing  the  boundedness 
of  supp  by  fast  decay  of  /i  at  infinity  (say,  by  the  convergence  of  J(expc|t|)d|//|(t)  for  a 

IR 

c>  0  entailing  the  analyticity  of  (i  in  the  strip  {\Imz\  <  c}.  The  wealth  of  subtle  uniqueness 
theorems  of  Complex  Analysis  yields  far  more  precise  and  profound  forms  of  the  t/P. 

The  complex  approach  gives  a  new  explanation  of  the  UP  phenomena  for  plus-charges  on 
R  and  T.  Suppose  a  function  /  summable  on  the  time  axis  R  has  no  past,  that  is  /|  (—oo,  0)  ~ 
0.  Then  its  Fourier  integral 

/(O  =  f  f{t)  e^p(-itOdt 

Jr 

makes  sense  not  only  for  real  ^ ,  but  also  for  any  $  G  C_ ,  the  open  lower-half-plane,  and  / 
is  analytic  there  ( the  condition  /  G  L^(R)  is  not  essential,  we  might  speak  of  a  charge,  an 
pP-function,  or  a  distribution  living  in  the  future,  i.e.  supported  by  (0,  +oo)).  Reversing  the 
order  of  time  and  frequency  and  changing  sign  in  the  exponent  we  conclude  that  any  plus- 
function  (distribution)  /  (i.e.  when  spec  /  C  [0,  +oo))  is  in  a  way  extendable  to  the  upper 
half-plane  C+  and  this  is  (heuristically)  a  complete  characterization  of  the  plus-functions:  if 
/  is  extendable  from  R  to  a  function  analytic  in  C+  satisfying  some  growth  conditions,  then 
/  is  a  plus-function  (this  is  not  a  theorem,  but  rather  a  useful  heuristic  principle). 

An  analogous  description  of  the  plus-functions  on  T  is  even  more  obvious.  A  Fourier 
series  lacking  negative  harmonics  becomes  a  power  series  converging  in  the  open 

n^O 

unit  disc  ID,  and  the  interpretation  of  a  plus-function  /  on  T  as  a  boundary  trace  of  its  sum 
looks  very  plausible  (and,  similarly  ^  f{a)z^  becomes  a  Laurent  series  in  C  \  (P  U  T)). 

n<0 

These  remarks  are  meaningful  and  rich  in  consequences  for  arbitrary  functions  (not  just 

for  plus-  or  minus-functions):  a  more  or  less  arbitrary  function  /  on  R  can  be  written  as 

+00 

/  ^  sense)  whence 

-oo 

-foo  0 

(17)  f  =  f+-  /-where/+(t)  =  j  f-(i)  =  -  j  f(0e'‘^d^, 

0  —00 

SO  that  f±  are  plus-  and  minus-functions  extendable  to  the  respective  half-planes.  An  analo¬ 
gous  decomposition  is  valid  for  functions  (and  distributions)  on  T:  if,  say,  /  G  Z/^(T),  then 
putting  f-\.  —  f-  —  ~  f{n)z'^  we  get  /  —  /+-/_,  a  formal  equality  to  be 

n^O  n<0 

duly  interpreted  which  is  quite  possible  in  many  cases. 

Let  us  remember  the  following  form  of  the  UP:  a  continuous  non-zero  plus-function  on 
T  cannot  vanish  on  a  set  of  positive  length  (see  section  1.1  of  Part  1).  This  fact  becomes 
now  a  boundary  uniqueness  theorem  for  functions  analytic  in  D  and  continuous  up  to  T, 
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and  a  conformal  mapping  of  C+  onto  P  immediately  yields  an  analogous  result  for  the  plus- 
functions  on  the  line. 

2.1.2.  The  complex  point  of  view  gives  a  new  insight  into  the  notions  of  support  and  spec¬ 
trum.  The  support  of  a  (say,  summable)  function  /  on  R  can  be  characterized  as  the  set  of 
singularities  of  its  Cauchy  potential 

since  f{t)  =  iim($(f  +  is)  —  —  zs))  a.e.  on  R.  But  spec/  =  supp/,  and  (if  /  is 

summable  and  in  many  other  cases)  spec  /  is  the  set  of  singularities  of  the  function  an¬ 
alytic  in  C  \  R  defined  as  $(C)  =  (27rz)“^  f  (/(^)/($  ~  C))<^?  which  is  readily  seen  to  be 

R 

+00  +00 

-(27r)“^  f  f(t)  exp(~it()dt  for  Im(  <  0  and  (27r)"^  /  f(t)  exp(~it()dt  for  Im(  >  0. 
0  0 

A  similar  description  of  supp  /  for  /  e  (T)  is  obvious:  it  coincides  with  the  set  of  sin¬ 
gularities  of  C  (27rz)“^  1),  or  the  complement  of  the  largest 

open  set  O  C  T  such  that  i  is  analytic  in  P  U  O  U  {|C|  >  1}. 

2.1.3.  Versions  of  the  UP  obtained  by  the  complex  tools  are  very  often  based  on  the  fol¬ 
lowing  fact:  suppose  F  ^  0  is  analytic  in  a  domain  O  C  C;  then  log  |F|  is  subharmonic  in 
O: 

(18)  log|F(a)|^  j  \og\F{a-\-rz)\dm(z) 

T 

provided  the  disc  {\z  -  a\  ^  r}  is  in  O  (this  is  Jensen  inequality  (2)).  The  subharmonicity  is 
akin  to  convexity,  and  (18)  implies  a  certain  rigidity  of  |F|:  the  smallness  of  |F|  on  a  small 
(but  solid)  part  P  of  O  makes  \F\  small  on  O  \  P  as  well.  A  rigorous  statement  of  this  kind  is 
the  two  constants  inequality  (4).  The  subharmonicity  of  log  |P|  for  an  analytic  F  entails  the 
following  extremely  useful  heuristic  principle:  if  a  non-zero  analytic  function  is  not  too  big 
(globally),  then  it  cannot  be  too  small  (even  locally).  If  for  example  supp  /  C  [—a,  a],  f  G 
L^([-cr,cr]),  then  |/(C)|  ^  consi exp(cr|C|),C  ^  C  which  is  a  global  growth  restriction 
imposed  onto  entire  function  /;  an  appropriate  form  of  the  Jensen  inequality  forbids  /  to 
decay  too  fast  along  R  or  to  have  too  many  zeros. 

To  conclude  these  introductory  remarks  let  me  mention  the  very  special  plasticity  of  the 
formulas  of  Complex  Analysis.  So,  for  example,  a  band  limited  function  has  at  least  four 

O' 

faces:  it  is  a  trigonometric  integral  f  f(^)  exp(i^t)d^,  but  also  a  power  series,  or  a  contour 

—  O’ 

integral,  the  Borel  transform  of  the  entire  function  /  (with  a  freedom  to  deform  the  contour), 
or  an  infinite  canonical  product. 
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Now  we  start  our  guided  tour  around  (some)  applications  of  the  complex  tools  to  the  UR 

2.2.  Fast  decay  of  f  and  f  at  infinity 

A  pair  (M,  N)  of  positive  functions  on  (0,  oo)  is  called  sufficient  if 
(19)  |/(<)|  ^  M(lt|)&|/(e)|  <  iV(|^|)(i.?  e  R)  =»  /  =  0. 

A  complete  description  of  sufficient  pairs  )  is  due  to  Morgan  who  proved  in  1934 

that  such  a  pair  is  sufficient  if  1/p  +  1/r  <  1  (not  depending  on  A,  B  >  0)  and  found  the 
conditions  to  be  imposed  on  {A,  B)  to  make  the  pair  sufficient  for  given  p,  r  satisfying  1/p  + 
1/r  =  1.  His  main  tool  was  the  Phragmen-Lindelof  theorem  (a  far  reaching  generalization  of 
the  maximum  modulus  principle).  Using  another  complex  tool  (the  Carleman  formula  for  a 
contour  integral  involving  the  logarithm  of  a  function  analytic  in  C+ )  Dzhrbashyan  obtained 
some  sufficiency  criteria  (replacing  the  pointwise  majorization  (19)  by  integral  estimates); 
he  also  got  a  description  of  some  “unilaterally  sufficient”  pairs  (that  is  those  (M,  N)  for 
which  /  =  0  follows  from  the  inequalities  in  (19)  if  both  (or  one)  of  them  are  fulfilled  only 
on  the  ray  (0,  +00)).  The  following  elegant  result  is  due  to  Beurling:  if  /  G  and 

//rxR  ^  /  =  0  (  [B])-  Th^  results  of  this  section  (with 

their  proofs  and  references)  can  be  found  in  [HJ],  p.  128- 137;  we  also  recommend  Nazarov’s 
article  [Na]  containing  a  new  approach  to  sufficient  pairs. 

2.3.  Deep  zero  &  fast  decay  at  infinity 

Let  H  he  Si  non-negative  function  on  [0,  +cx)).  In  this  section  we  call  it  sufficient  if 

f(t) .  e  =  0(1)  (|f|  ->  00, n  G  Z+)  &  \f(t)\  <  H(\t\)  (teR)^f  =  0. 

If  =  0(1)  {t  ->  +00)  for  any  n  >  0,  then  “the  depth  of  zero”  of  /  at  the  origin  just 

means  /^”^(0)  =  0,n  G  Z+.  This  is  actually  a  classical  quasianalyticity  problem  related  to 
the  moment  problem  and  weighted  polynomial  approximation. 

A  necessary  condition  for  H  to  be  sufficient  is  C{H)  =  -oo  (C{H)  =  log HdU, 

see  Part  1  section  1.1);  this  “quantitative”  condition  is  sufficient  if  H  satisfies  a  “qualitative” 
regularity  condition  (which  cannot  be  dropped),  e.g,,  if  H  is  logarithmically  convex  {H(x)  = 
exp{-x/\og{x  +  1))  or  exp  (-a;/ log  (a:  -I- 1)  loglog(a:  1) . . .)  are  sufficient  (see  [Koo2], 
[HJ]).  This  combination  of  a  quantitative  condition  (19)  and  a  qualitative  one  (the  regularity 
of  the  majorant)  is  very  typical  for  many  forms  of  the  UP. 

2.4.  Deep  zero  <Sc  sparse  spectrum 

Here  we  return  to  the  Mandelbrojt  theorem  (see  1.3.5  of  Part  1)  and  briefly  discuss  its 
proof  due  to  Levin  (this  proof  results  actually  in  a  much  stronger  theorem  which  we  won’t 
state  here,  see  [L],  [HJ]).  The  strategy  of  the  proof  is  this:  suppose  /  G  L^(T)  has  a  deep 

e 

zero  at  1  (say,  /  \f{e^^)\dt  =  as  e  |  0;  p  is  positive);  look  at  the  Laplace  transform 

0 
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+00 

0  of  the  periodic  function  G  M);$(p)  =  /  (p{t)e~P^dt  coincides  in 

0 

{Rep  >  0}  with  the  meromorphic  function  Xlngz  fi'^)(P  ~  of  a  very  tempered  growth 
off  the  union  of  the  discs  {\z  -  in\  <  1/4}  ;  the  deep  zero  of  /  at  1  (that  is  the  deep  zero  of 
(p  at  the  origin)  can  be  translated  as  the  fast  decay  of  +  ir})  |  as  ^  f  +oo  (uniformly  in  p). 
The  Poisson-Jensen  formula 

R 

J{N{t)/t)dt  =  I  log|$(ilC)|rfm(C)-log|$(0)| 

0  T 

where  N{t)  =  n{t)  —  p(t),n{t),p{t)  being,  resp.,  the  numbers  of  zeros  and  poles  in  fD, 
implies  the  estimate 

R 

[  (p{t)/t)dt  )  log  rnnoidmio  +  log  |$(0)|; 

J  \JTr\{Re(:>0}Jrn{ReC<0}/ 

the  first  integral  tends  to  -oo  and  so  does  the  whole  bracket,  since  the  modulus  of  the  second 
integral  grows  too  slowly;  if  $  is  regular  at  the  origin  and  $(0)  ^0  (which  we  may  assume  if 
then  p{t)/t  has  to  be  big  for  arbitrarily  large  values  of  i,  and  an  excessive  sparseness 
of  spec  /  (=  the  set  of  poles  of  is  impossible.  We  can  now  go  back:  given  a  sufficiently 
sparse  set  A  of  integers  we  construct  a  meromorphic  function  ^  =  l/B  where  B  is  a  suitable 
infinite  product  (an  entire  function)  vanishing  exactly  at  the  points  in,  n  €  A,  and  growing 
fast  enough  off  the  discs  {\z  ~  in\  <  a}  for  a  positive  a;  applying  the  Riemann-Mellin 
inversion  formula  for  the  Laplace  transform  to  ^  we  obtain  a  periodic  <p  and  then  f  e  C{T) 
with  spec  /  =  A  and  a  deep  zero  at  1;  this  is  how  the  sharpness  of  the  Mandelbrojt  theorem 
is  proved. 

2.5.  Semibounded  spectra 

The  complex  point  of  view  sheds  a  new  light  upon  the  functions  with  semibounded  spec¬ 
tra.  Here  we  can  only  mention  the  highly  developed  theory  of  the  Hardy  classes  whose 
main  objects  are  -functions  with  non-negative  frequencies. 

Let  /  be  an  -function  (1  ^  p  ^  +oo)  on  X  =  T  or  R  with  a  non-negative  spectrum: 
spec  /  C  [0,  +oo)  ( if  p  >  2,  X  =  R,  then  spec  /  is  the  support  of  the  distribution  /);  then 
we  say  /  6  the  Hardy  space  on  X.  Another  object  related  to  H^{X)  is  a  Banach 

space  H^(p)  of  functions  analytic  in  0(=  D  for  X  =  T,  C+  for  X  =  R)  satisfying  a  certain 
LP-growth  restriction.  It  turns  out  that  any  F  6  H^{0)  has  finite  boundary  values  along  the 
normals  m  -a.e.  on  X  thus  defining  a  function  F*  e  Hp{X), 

F*{t)  =  hm  F{rt)  (f  G  T),  F*{t)  =  \im  F{t  +  ie)  (f  G  R). 
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The  mapping  F  F*  takes  H^{0)  isometrically  onto  H^iX),  so  that  the  1X{X)- 
functions  with  no  negative  frequencies  can  be  identified  with  the  analytic  functions  in  H^{0), 
This  close  connection  makes  it  possible  to  understand  completely  many  forms  of  the  UP  for 
the  plus-functions  (including  continuous  and  smooth  plus-functions,  see  [Du],  [Pr],  [Ho], 
[Ga],  [Kool],  [HJ];  these  forms  are  usually  sharp  in  contrast  with  other  spectral  “smallnesses” 
(e.g.,  for  the  band  limited  functions,  see  section  2.6  below).  As  an  example  we  consider  here 
a  complete  and  quite  satisfactory  description  of  the  moduli  of  the  (A) -functions. 

Theorem.  Let  h  0  be  a  non-zero  function  on  X.  The  following  are  equivalent:  (i) 
h  =  |/|  where  f  e  Hp(X);  (ii)  h  e  L^{X,  m)  and  C{h)  >  oo  (see  section  LI  in  Part  1). 

Thus  the  convergence  of  the  logarithmic  integral  C{h)  is  the  only  smallness  restriction  for 
the  equation  |/|  =  h^f  G  H^{X)  to  b&  solvable.  Its  necessity  follows  immediately  from  the 
Jensen  inequality;  its  sufficiency  can  be  proved  by  an  explicit  construction:  if  /i  €  I/{T,  m) 
and  C{h)  >  -oo,  then  Ext /i :  i->  exp /log/i(C)^rfrri(C)  is  in  /fP(lD))  and  |(Ext/i)'‘|  ~  h 

a.e.  on  T;  Ext  h  is  the  so-called  outer  (or  exterior)  function  corresponding  to  h.  An  analogous 
formula  can  be  written  for  X  =  M  as  well. 

The  conditions  of  the  solvability  of  the  equation  \f\  =  h  with  an  unknown  band  limited 
function  /  (i.e.  with  a  bounded  and  not  just  semibounded  spectrum)  can  hardly  be  expressed 
in  palpable  terms  (see  however  [Dy]  for  some  useful  results  in  this  direction).  In  the  next 
section  in  place  of  the  equation  |/|  =  we  turn  to  non-trivial  band  limited  solutions  of  the 
inequality  |/|  ^  h. 

2.6.  Fast  decay  at  infinity  and  bounded  spectrum 

Let  be  a  non-negative  function  defined  on  E.  We  call  it  a  Beurling-Malliavin  majorant 
(BM-majorant)  if  there  exists  a  non-zero  function  /  with  a  bounded  spectrum  such  that 

(20)  I/I  <  h. 

A  bounded  set  being  semibounded  we  immediately  conclude  that  any  BM-majorant  h  satis¬ 
fies  C{h)  >  -oo.  But  in  contrast  with  section  2.5  this  condition  is  far  from  being  sufficient: 
to  guarantee  the  solvability  of  (20)  with  a  band  limited  /  ^  0  we  have  to  impose  some  reg¬ 
ularity  conditions  on  h  to  moderate  its  oscillations  at  infinity.  The  reason  is  simple:  a  band 
limited  function  is  entire  and  of  finite  degree  (i.e.  f{t)  =  0(expcr|t|),  t  e  C,  |f|  — >  +oo): 
the  Poisson-Jensen  formula  shows  that  the  zeros  of  /  tend  to  run  away  with  a  certain  speed 
from  any  (big)  disc  (so  that  the  number  of  zeros  of  /  in  rlD  is  0{r)  as  r  t  +oo).  And  if, 
say,  h{^)  =  0,n  =  1, 2, . . . ,  or  even  if  h{^/n)  tends  to  zero  fast  enough,  then  (20)  im¬ 
plies  /  =  0;  but  this  behavior  of  h  is  very  well  compatible  with  C(h)  >  -oo  if  only  the 
slopes  of  the  pits  on  the  graph  of  h  over  y/n  are  steep,  i.e.  if  h  oscillates  intensely.  (There 
are,  of  course,  even  much  more  obvious  obstacles  for  an  h  with  C(h)  >  —  oo  to  be  a  BM- 
majorant:  for  example  h{t)  =  exp(-l/yjif)  is  not  a  BM-majorant.  But  we  concentrate  now 
on  BM-majorants  bounded  off  zero  on  any  bounded  interval.) 
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Assume  jC{h)  >  —  oo;  the  characterization  of  the  oscillations  of  h  at  infinity  compatible 
with  h  being  a  BM-majorant  is  a  very  hard  problem.  A  remarkable  breakthrough  is  due  to 
Beurling  and  Malliavin.  Their  work  [BMl]  describing  a  large  class  of  BM-majorants  is  deep 
and  difficult  involving  a  good  deal  of  potential  theory  and  complex  analysis.  Here  I  state  only 
one  corollary:  Suppose  h  is  bounded  and  strictly  positive;  if  C{h)  >  — oo  and  \ogh  satisfies 
a  Lipschitz  condition,  then  h  is  a  BM-majorant. 

Another  (and  even  more  famous)  corollary  is  the  so-called  Beurling-Malliavin  multiplier 
theorem  which  I  won’t  state  here.  Subsequent  proofs,  simplifications,  and  approaches  to 
these  corollaries  are  due  to  Koosis,  Kargaev,  and  Nazarov  see  [Koo2,Koo3];  one  more  proof 
is  in  [DeBr].  But  the  original  result  still  seems  to  remain  the  most  general  ( it  is  also  exposed 
in  [HJ],  p.  306-369), 

Note  that  if  h  is  even  and  decreasing  on  [0,  -l-oo)  (no  oscillations  at  all),  then  C{h)  >  -oo 
is  sufficient  for  /i  to  be  a  BM-majorant.  This  fact  is  relatively  simple  and  known  actually  for 
a  long  time  before  the  Beurling-Malliavin  theorem  had  been  proved  (see  the  references  and  a 
proof  in  [HJ],  p.  276). 

Another  “whale”,  the  second  Beurling-Malliavin  theorem,  can  be  only  named  here.  It  is 
devoted  to  the  following  form  of  the  UP:  bounded  support  &  missing  frequencies  (character¬ 
ization  of  the  discrete  sets  A  C  E  such  that  /  |A  =  0  for  a  function  f  ^  0  concentrated  on 
[-(j,  a],  a  >  0;  see  [BM2,  Koo2, HJ]).  Denoting  by  xa  the  characteristic  function  of  a  set  A 
we  may  rephrase  the  problem:  describe  the  BM-majorants  of  the  form  xr\A‘ 


2.7.  Four  theorems  on  the  unilateral  decay 


This  series  of  theorems  starts  historically  with  the  following  result  due  to  Levinson  and 
Cartwright: 

(I)  Suppose  f  e  (T)  satisfies 

(21)  I  /  (n)  I  ^  ( I  n  I )  for  all  negative  integers  n, 

/i :  [1,  -l-oo)  ->  (0,  -f-oo)  being  a  decreasing  function  (“unilateral  decay  of  /”).  If 


(22) 


E 

n=l 


log  h{n) 


■OO, 


then  /  cannot  vanish  identically  on  a  non-degenerate  arc  unless  /  =  0. 

In  1960  Beurling  proved  the  following  result  which  is  in  fact  much  stronger  than  (I):  for 
a  finite  charge  ponR  and  A  >  0  put  pf^iA)  =  {varp)[[A^  +oo)); 

(II)  if 


+00 

f  log  Pm  (A) 

J 


1 


dA  =  —oo, 


(23) 
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then  /i  cannot  vanish  identically  on  a  set  of  positive  length  unless  yu  =  0. 

This  time  “the  unilateral  decay”  refers  to  “the  object”  pt  and  the  spectral  smallness  means 
vanishing  on  a  large  set.  The  remarkable  feature  of  this  result  is  the  total  absence  of  the 
regularity  conditions  (cf.  section  2.3  and  2.6).  The  strategy  of  the  proof  is  killing  the  Cauchy 
potential  C[}j){Q  =  f(t  —  ^  E)  for  any  p  satisfying  (23)  with  fi  vanishing  on 

R 

a  set  of  positive  length.  The  Levinson-Carwright  theorem  (I)  follows  from  (II)  very  easily. 

(III)  (The  Volberg  Theorem)  Suppose  the  conditions  of  (I)  are  fulfilled  and  h  satisfies  some 
supplementary  regularity  conditions  (not  to  be  stated  here);  then  £(/)  >  -oo  unless  /  =  0. 

The  conclusion  of  this  theorem  is  much  stronger  than  in  (I),  but  it  does  not  imply  (I) 
because  of  those  unnamed  regularity  conditions  (their  sharp  form  is  due  to  J.  Brennan,  see 
[Koo2,HJ]);  the  regularity  of  h  in  (I)  is  its  mere  decrease.  Theorem  (III)  was  conjectured  by 
Dyn’kin  in  1975;  its  proof  is  based  on  Dyn’kin’s  theory  of  pseudoanalytic  continuation  and 
delicate  estimates  of  pseudoanalytic  functions  in  the  so-called  boundary  layers. 

(IV)  (The  Borichev  Theorem)  The  last  result  of  this  series  is  due  to  Borichev  and  looks 

(at  first  glance)  even  stronger  than  (III).  It  is  applicable  not  only  to  functions  on  T,  but 
to  distributions  and  even  to  hyperfunctions.  Any  two-sided  sequence  (an)nGZ  of  complex 
numbers  such  that  limsup  ^  1  generates  two  analytic  functions  g+,g^: 

\n\^oo 

fl+(o  =  y]a„c"(ici  <  i),<?-(c)  =  -  ^o„c"(ici  >  1). 

n^O  n<0 

If  |a„|  —  0{\n\^)  for  a  positive  m,  then  is  the  Fourier  series  of  a  distribution 

nez 

r  on  T  and  suppT  is  the  complement  of  the  largest  open  part  of  T  across  which  g+  is 
analytically  extendable  to  -g^  .  The  Borichev  theorem  asserts  in  particular  that  if 

lim  log  |a„|//i(|n|)  =  -oo,  lim  suplog  |a„|/fi.(n)  <  -foo, 

n-y+oo 

and  h  satisfies  (22)  and  some  regularity  conditions  (again  !)  then  it  is  impossible  for  the  non- 
tangential  limits  of  g+  and  g.  to  coincide  on  a  subset  of  T  of  positive  length  unless  a„  =  0. 
In  fact,  [Bor,  BorV]  contain  much  stronger  results  involving  the  divergence  of  a  logarithmic 
integral  (to  be  defined  properly,  since  J]  UnC”  is  not  a  function  on  T,  and  g±  are  not  bound  to 
possess  boundary  values  on  T). 

For  the  most  popular  classical  majorants 

h{n)  =  exp(-cn/  logn  •  log  log  n  • . . .) 

each  of  these  four  theorems  is  a  step  forward  compared  with  the  preceding  one.  But  in  general 
none  of  them  implies  the  rest  (because  of  the  discrepancies  in  the  regularity  conditions).  Their 
proofs  are  different,and  it  is  unclear  whether  they  can  be  encompassed  by  a  single  statement 
and  proof. 
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2.8.  Spectral  gap  &  sparse  support 

We  say  a  tempered  distribution  T  on  R  has  a  spectral  gap  if  R  \  spec  T  contains  a  non¬ 
degenerate  interval.  The  Beurling  theorem  ((II)  in  section  2.7  above)  forbids  the  fast  unilateral 
decay  of  a  function  (or  a  charge)  with  a  spectral  gap.  But  now  we  are  going  to  discuss  the 
sparseness  of  the  support  of  a  charge  with  a  spectral  gap.  One  more  Beurling  theorem  gives 
an  answer:  Suppose  S  CR  is  a  closed  set  such  that 

f  disiix^S)  , 

K 

Then  any  non-zero  charge  p  supported  by  S  has  no  spectral  gap  ( [B,  Koo2,  HJ]). 

A  proof  due  to  Koosis  is  based  on  the  Pollard  approach  to  weighted  approximation  and 
the  Bernstein  band  limited  function  cos  <7^(2:  --  -  B?  peaking  at  Xq  G  R  and  bounded 

by  one  off  {xq  -  R^Xq  R).  This  result  is  only  an  illustration.  The  problem  to  characterize 
the  support  carrying  a  charge  with  a  spectral  gap  has  impressive  connections  with  poten¬ 
tial  theory, weighted  approximation  by  polynomials  and  entire  functions  of  finite  degree  (see 
the  results  by  Levin,  Aldiiezer  &  Levin,  Kargayev,  Benedicks,  Koosis,De  Branges,  Levin  & 
Logvinenko  &  Sodin  quoted  in  [HJ],  p.375).  Here  we  only  mention  that  the  Beurling  the¬ 
orem  of  this  section  is  sharp  in  the  following  sense:  the  spectral  gap  cannot  be  replaced  in 
its  statement  by  a  set  of  positive  length;  this  was  predicted  by  Koosis  and  proved  by  Kar- 
gaev’s  counterexample  (his  original  construction  was  simplified  by  Kislyakov  and  Nazarov, 
see  [HJ],  p.520). 

We  conclude  this  section  by  the  following  problem  posed  by  Sapogov:  is  there  a  set 
A  C  R  of  finite  length  whose  characteristic  function  xa  has  a  spectral  gap?  The  answer  is 
yes,  it  is  due  to  Kargayev  who  constructed  such  an  A  as  the  union  of  disjoint  intervals 
gravitating  to  n  as  |n|  +00  ;  their  endpoints  can  be  computed  by  the  Newton-Kantorovich 

method  in  which  yields  alot  of  information  on  In  (see  [HJ],  p.376-392  and  the  paper  by 
Kargaev  &  Volberg  quoted  there). 

2.9.  Sparse  support  &  unilateral  decay 

A  compact  set  K  C  T  is  called  spacious  if  it  carries  a  non-zero  charge  p  such  that 
(24)  \fi(n)\  =  0{\n\-^)  [n  -oo) 

for  any  m  >  0.  The  characterization  of  “bilaterally  spacious”  sets  K  (i.e.  carrying  a  charge 
/Li  ^  0  satisfying  (24)  with  |n|  -A-  00)  is  easy:  they  are  just  the  sets  with  interior  points. 
The  unilateral  character  of  (24)  makes  the  description  of  spacious  sets  a  much  more  delicate 
task.  It  is  obvious  that  the  length  of  a  spacious  K  is  positive,  since  any  p  satisfying  (24)  is 
m-absolutely  continuous  by  the  F.  and  M.  Riesz  theorem.  But  this  condition  is  not  sufficient. 
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Denote  by  C{K)  the  set  of  all  components  ofT\K  and  call 

the  entropy  of  K, 

The  HruSCev  Theorem.  K  is  spacious  iff  it  contains  a  compact  subset  of  positive 
length  and  finite  entropy  (see  [Hru,  HJ]j. 

This  is  a  difficult  result.  It  uses  among  other  things  some  variants  of  the  Khinchin- 
Ostrowski  theorem  on  normal  families  of  functions  analytic  in  a  disc,  delicate  estimates  of 
outer  functions  and  a  clever  construction  of  a  special  measure  on  T  (its  simplified  version  due 
to  N.  Makarov  is  in  [HJ]).  The  Hruscev  theorem  stated  above  is  just  a  representative  of  a  long 
series  of  his  results  on  “sparse  supports  &  unilateral  decay”  including  many  concrete  sorts  of 
“decay”  and  various  “objects”  (not  necessarily  charges);  only  a  part  of  the  results  of  [Hru]  is 
in  [HJj. 

3.  Local  and  Antilocal  Convolutions 

This  part  is  devoted  to  a  form  of  the  UP  for  shift  invariant  linear  operators;  we  call  it 
antilocality.  The  most  interesting  examples  and  problems  come  from  potential  theory  and 
are  discussed  in  section  3.3.  A  class  of  antilocal  operators  is  the  theme  of  section  3.2,  in 
section  3,1  we  discuss  local  (“almost  differential”)  operators  as  opposed  to  the  antilocality  of 
sections  3.2  and  3.3.  In  this  part  everything  is  closely  related  to  the  UP  of  Parts  1  and  2. 

3.1.  Local  and  completely  local  convolutions 

We  denote  by  D'(M'^)  the  set  of  all  distributions  in  Let  /C  be  a  linear  operator  mapping 
a  linear  set  domK  C  D'(M^)  into  .  We  call  it  local  if  it  does  not  increase  the  support: 

T  e  domK  =>  supp  KT  c  supp  T, 

or  what  is  the  same  K{T)\0  depends  only  on  T\0  for  any  open  O  C  and  T  G  domK.  A 
typical  example  is  any  linear  differential  operator  with  (7°° -coefficients;  this  is  in  a  sense  the 
only  possible  example:  it  can  be  proved  under  some  mild  conditions  to  be  imposed  on  domK 
(but  not  in  general  !)  that  local  operators  are  differential  (the  Peetre  theorem). 

A  local  operator  K  reproduces  any  open  zero  set  E  of  a  distribution: 

(25)  T  e  domK,  T\E  =  (}^  KT\E  ^  0. 

Suppose  domK  and  imK  =  K{domK)  consist  of  locally  summable  functions  so  that  T\E 
and  KT\E  make  sense  for  T  6  domK  and  any  Lebesque  measurable  set  E  c  Then  we 
call  K  completely  local  if  (25)  holds  for  any  such  E  (not  only  open).  Any  linear  differential 
operator  whose  domain  consists  of  sufficiently  smooth  functions  is  completely  local. 
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Now  we  turn  to  the  shift  invariant  operators  JC  on  M:  Let  K  bQ  a  Lebesque  measurable 
function  on  R,  Consider  the  convolution  operator  K 

(26)  domic  =  {/  e  Kf  e  Kf  =  T^  {J  e  domIC). 

De  Branges  found  a  complete  characterization  of  the  symbols  of  local  operators. 

The  De  Branges  Theorem,  (a)  Suppose  K  is  a  restriction  to  R  of  an  entire  function 
k  of  the  Cartwright  class  and  degree  zero  (that  is  k{()  =  0{exp5|Cl),  |C| 
e  >  0,  and  C(k)  <  +oo  );  then  K  is  local; 

(b)  suppose  domic  contains  a  non-zero  function  vanishing  on  a  non-degenerate  interval; 
if  1C  is  local, then  K  —  k\Rfor  an  entire  function  k  of  the  Cartwright  class  and  degree  zero. 

In  other  words  local  shift  invariant  operators  with  a  sufficiently  rich  domain  are  precisely 
“the  almost  differential  linear  operators  with  constant  coefficients”;  they  can  be  written  for- 

00  oo 

mally  as  f  where  represents  a  slowly  growing  entire  function,  not  too  far 

k=0  k=0 

from  a  polynomial. This  result  is  only  a  particular  case  of  a  much  more  precise  theorem  due  to 
De  Branges  and  describing  the  symbols  of  the  so-called  a-local  convolutions  ( [DeBr,HJ]). 

In  the  extreme  case  of  a  polynomial  symbol  K  our  operator  K  becomes  a  usual  linear 
differential  operator  with  constant  coefficients  defined  on  the  Sobolev  space  TTJ  —  [J  £ 

:  f  ^  (7(^-1)^  absolutely  continuous,  e  —  degK).  In  this  case  1C  is 
not  just  local,  but  completely  local.  De  Branges  posed  the  following  question:  suppose  1C  is 
almost  differential  (as  in  his  theorem  above);  is  it  completely  local  ?  A  counterexample  was 

oo 

constructed  by  Kargayev.  He  actually  showed  that  f  Y  can  be  not  completely 

k=0 

oo 

local  even  for  an  entire  symbol  k{()  =  Y  of  order  zero  (i.e.  k{Q  =  0(exp  |CP)  for 
any  e  >  0),  see  [HJ],p.482-484. 

3.2.  Complete  antilocality 

The  results  of  section  3.1  are  in  fact  closely  related  to  the  themes  of  Part  2,  but  only 
with  respect  to  the  tools.  We  turn  now  to  an  opposite  property  of  certain  convolutions  (the 
antilocality)  which  is  itself  a  form  of  the  UP  for  a  shift  invariant  operator  forbidding  us  to 
know  too  much  on  the  operator.  Our  theme  here  is  the  compulsory  increase  of  the  support  of 
a  non-zero  function  under  certain  convolution  operators. 

A  linear  operator  1C  defined  on  domK  C  V’{^)  with  values  in  X>'(R‘^)  is  called  antilocal 
if 

T  e  domic,  T  7^  0  ^  supp  /C(T)  D  R^  \  supp  T 
which  means  that  the  following  UP  holds:  for  any  non-empty  open  E  and  T  G  domIC 

(27)  T\E  =  1C{T)\E  =  0^T  =  0, 
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forbidding  the  simultaneous  vanishing  of  T  and  /C(T)  on  a  solid  (=open)  set.  This  UP  is 
valid  for  some  interesting  convolution  operators  and  is  akin  to  its  “harmonic”  prototype.  The 
simplest  example  is  the  Hilbert  transform  (in  E)  taking  /  G  to  f,f{x)  =  p.v.  f  f(t) /{t  - 

R 

x)dt;  in  spectral  terms  this  means  /(O  =  C'Sgn^‘f{^).  If  /,  /  both  vanish  on  an  open  C  E, 
then  the  function  (p  :  (  >->  f  -  Qdt  analytic  in  the  domain  U  U  C_  vanishes 

R 

on  E  and  is  thus  identically  zero;  being  the  jump  of  p  as  its  argument  crosses  E,  /  ==  0  a.e. 
on  E.  It  is  not  hard  to  see  that  the  logarithmic  potential  /  /  *  log  |a;|  and  the  M.Riesz 

potentials  /  i-4  f  *  \x\~^  (0  <  a  <  1)  are  antilocal.  But  the  Hilbert  transform  and  the 
logarithmic  potential  enjoy  in  fact  a  much  stronger  uniqueness  property:  they  are  completely 
antilocal.  We  say  that  K  is  completely  antilocal  if  (27)  holds  not  only  for  any  open  E,  but 
for  any  set  E  of  positive  Lebesgue  measure  in  E^  (we  assume  that  domK  and  imK  consist 
of  locally  summable  functions).  It  is  sometimes  quite  hard  to  prove  (or  disprove)  that  an 
antilocal  operator  is  completely  antilocal;  for  the  Hilbert  transform  this  property  coincides 
with  the  UP  for  ii7^(E),  see  section  1.1  of  Part  1. 

The  symbols  of  local  operators  described  by  the  De  Branges  theorem  of  section  3.1  con¬ 
sist  of  a  single  analytic  block  being  polynomials  or  entire  functions.  It  turns  out  that  many 
symbols  consisting  of  two  different  “analytic  blocks”  define  an  antilocal  (or  even  a  completely 
antilocal)  operator. 

Let  be  a  Lebesgue  measurable  function  on  E;  6,  c  G  E,  6  <  c.  Suppose  K  coincides 
on  (c,  +oo)  with  a  rational  function  r,  and  |{^  :  f  ^  ^  7'(^)  iG($)}|  =0.  Then  we  call  K 

a  semirational  symbol  and  r  its  rational  part.  A  typical  example  is  K(^)  =  sgn^  •  r(^)  where 
r  is  rational  ( if  r  =  l,then  we  get  the  Hilbert  transform). 

Define  K  by  (26).  This  operator  is  completely  antilocal  for  many  semirational  symbols, 
e.g.,  for  any  K{^)  of  the  form  sgn^/q{^)  where  g  is  a  polynomial.  But  for  general  semi¬ 
rational  symbols  (27)  is  only  proved  under  a  supplementary  smoothness  conditions  to  be 
imposed  on  T  ;  (27)  can  be  restored  for  all  T  G  domK  provided  E  satisfies  an  extra  “en¬ 
tropy  condition”  as  in  the  HruSdev  theorem  (section  2.9  of  Part  2),  and  it  is  unknown  whether 
these  supplementary  conditions  can  be  dropped  ( [HJ],  p.484-488  and  the  references  therein 
including  papers  by  Havin,  Joericke,  Makarov,and  Ch.Bishop).  For  example  it  is  unknown 
whether  (27)  is  true  for  K{^)  =  sgn^  •  (^  —  i)/{^  —  2i)  (it  is  true  if  we  assume  T  G 

Another  interesting  antilocal  convolution  is  the  M.  Riesz  potential 

whose  symbol  also  consists  of  two  different  analytic  pieces.  As  to  the  complete 

antilocality,  it  is  only  known  that  for  a  G  (0, 1)  the  following  property  holds:  if 

f  IfitWl-'^dt  < -Voo, 

\t\>i 


(28) 


V.P.  Havin  /On  the  Uncertainty  Principle  in  Harmonic  Analysis 


27 


(29)  ECR^  \E\  >  0,  f\E  =  (/  *  \x\-^)\E  =  0, 

and  f  satisfies  a  Holder  condition  (depending  on  a)  near  E,then  /  =  0  (see  [HJ],  p.499-508 
where  all  real  values  of  a  are  also  considered).  Thus  an  extra  smoothness  condition  emerges 
here  once  again  although  the  proof  is  quite  different  from  the  proof  of  the  UP  for  semirational 
symbols.  But  in  this  case  these  conditions  cannot  be  dispensed  with.Using  a  method  due  to 
Bourgain  &  Wolff  a  non-zero  continuous  function  /  on  E  has  been  constructed  in  [BH]  which 
satisfies  (28)  and  (29). 

3.3.  A  uniqueness  problem  for  the  Newton  potentials 

An  extremely  interesting  example  of  the  antilocal  behaviour  can  be  observed  on  the  New¬ 
ton  convolution  in  E^,  i.e.  on  the  operator 


/  ->■  U^,  Uf{x)  =  f  X  6  R" 

J  \x~-y\ 

(or,  more  generally,  on  the  M.Riesz  potentials 


UL{=^)  =  j 


The  antilocality  of  the  Newton  potential  U  can  be  interpreted  as  a  uniqueness  property 
of  the  solutions  of  the  Cauchy  problem  for  the  Laplace  equation  in  the  upper  half-space  E^, 
and  it  is  very  close  to  a  boundary  uniqueness  property  of  harmonic  (^divergence-  and  curl- 
free)  vector  fields  in  E^,  a  three  dimensional  analog  of  the  boundary  uniqueness  theorem  for 
functions  analytic  in  the  upper  half-plane  C+  (=  a  UP  for  plus-functions,  see  section  1.1  of 
Part  1  and  section  2.5  of  Part  2  ). 

A  remarkable  construction  due  to  Bourgain  and  Wolff  [BW]  (preceded  by  a  breakthrough 
in  [W]  and  some  simplifications  due  to  Aleksandrov  and  Kargayev  [AK]  )  has  disproved 
the  complete  antilocality  of  the  Newton  potential  in  E^  :  there  exists  a  continuous  non-zero 
function  /  in  E^  such  that  =  /  =  0  on  a  set  of  positive  area  in  E^ .  It  is,  however,  unknown 
whether  such  an  example  is  possible  with  a  smooth  (say  not  to  mention  )  function 
/.  The  one-dimensional  construction  of  [BH]  related  to  the  M.Riesz  potentials  suggests  that 
the  answer  may  be  negative.  The  antilocality  properties  of  the  Newton  potentials  are  one  of 
the  themes  of  [HJ]  (see  p. 48 8-508,  and  the  references  to  the  papers  of  Mergelyan,  Landis, 
M.M.Lavrentjev,  N.Rao,  and  Maz’ya  &  Havin). 


Acknowledgement 

This  text  was  written  during  my  stay  at  the  Department  of  Mathematical  Sciences  of 
NTNU  (Trondheim)  in  May  2000.1  want  to  express  my  sincere  gratitude  to  the  Department 
for  its  hospitality  and  excellent  working  atmosphere.  I  am  also  indebted  to  Yu.Lyubarskii 


28 


V.P.  Havin  /On  the  Uncertainty  Principle  in  Harmonic  Analysis 


for  useful  discussions  which  have  influenced  some  parts  of  the  text  and  to  E.Malinnikova  for 
detecting  and  removing  some  misprints. 

References 

[AK]  Aleksandrov  A.B.,  Kargaev  P.P.,  Hardy  classes  of  Junctions  harmonic  in  the  upper  halfspace.  Algebra  i 
Analiz,  5,  N2, 1993,  p.1-73  (Russian;  English  translation:  St.Petersburg  Math.J.,  5,  N2, 1994,  p.663-718). 

[B]  Beurling  A.,  Collected  works,  vols  I-II.  Birkhauser,  Boston,  1989. 

[Ba]  Bari  N.K.  A  treatise  on  trigonometric  series,  vols  I-II.  Pergamon-Press,  Oxford,  1964. 

[BH]  Beliaev  D.B.,  Havin  V.P.,  On  the  uncertainty  principle  for  the  M.Riesz  potentials.  Preprint,  Department 
of  Math,  Royal  Institute  of  Technology,  Stockholm,  TRITA-MAT- 1999-24  (Dec  1999),  p.1-18. 

[Bo]  Boas  R.P.,  Entire  Functions,  Academic  Press,  NY,  1954. 

[Bor]  Borichev  A.  A.,  Boundary  uniqueness  theorems  for  almost  analytic  Junctions  and  asymmetric  algebras. 
Mat.Sbomik,  136,  N3, 1988  (Russian). 

[BorV]  Borichev  A.  A.,  Volberg  A.L.,  Uniqueness  theorems  for  almost  analytic  functions.  Algebra  i  Analiz,  1, 
Nl,  1989,  p.  146-177.  (Russian;  English  translation:  Leningrad  Math.J. , 1,  Nl, 1990,  p.  157-192.) 

[BMl]  Beurling  A.,  Malliavin  R,  On  Fourier  transforms  of  measures  with  compact  support.  Acta  Math.,  107, 
1962,  p.  291-302. 

[BM2]  Beurling  A.,  Malliavin  R,  On  the  closure  of  characters  and  the  zeros  of  entire  functions.  Acta  Math., 
118,Nl-2,  1967,  p.  79-93. 

[BW]  Bourgain  J.,  Wolff  T.  A  remark  on  gradients  of  harmonic  functions  in  dimension  3.  Colloq.Math.,  60/61 , 
Nl,  1990,  p.  253-260. 

[Car]  Carleson  L.,  Selected  problems  on  exceptional  sets.  Van  Nostrand,  Princeton,  New  Jersey,  1967. 

[Carl]  Carleman  T.,  Vint^grale  de  Fourier  et  quelques  questions  qui  s’y  rattachent.  AImkvist  und  Wiksels 
boktr.,  Uppsala,  1944. 

[DeBr]  De  Branges  L.  Hilbert  spaces  of  entire  Junctions.  Prentice  Hall,  Englewood  Cliffs  (N.J.),  1968. 

[Du]  Duren  RL.  Theory  oj  H^  spaces.  Academic  Press,  NY,  1970. 

[Dy]  Dyakonov  K.M.,  Moduli  and  arguments  oj  analytic  functions  from  the  subspaces  invariant  for  the  back¬ 
ward  shift  operator.  Sibirsk.  mat.  zhum.,  31,  N6,  1990,  p.  64-79  (Russian). 

[Ga]  Garnett  J.,  Bounded  analytic  functions.  Academic  Press,  NY,  1981. 

[Gor]  Gorin  E.A.,  Some  remarks  in  connection  with  a  problem  of  B.P.Paneah  on  equivalent  norms  in  spaces  of 
analytic  functions.  Teoriya  funktsii,  funkt.  analiz  i  ih  prilozhenia,  44,  1985,  p.  23-32  (Russian). 

[HJ]  Havin  V.,  Joricke  B.,  The  uncertainty  principle  in  harmonic  analysis.  Springer- Verlag,  NY,  1994. 

[Ho]  Hoffman  K.,  Banach  spaces  of  analytic  functions.  Prentice  Hall,  Englewood  Cliffs.  N.J.,  1962. 

[Hru]  HruSCev  S.V.,  Simultaneous  approximation  and  removable  singularities  of  integrals  of  the  Cauchy  type. 
Trudy  MIAN,  130, 1978,  p.  124-195  (Russian). 

[JH]  Joricke  B.,  Havin  V.,  Traces  of  harmonic  functions  and  comparison  of  norms  of  analytic  functions.  Math. 
Nachr.,  98, 1980,  p.  269-302  (Russian). 

[Kah]  Kahane  J.-R,  Siries  de  Fourier  absolument  convergentes.  Springer- Verlag,  NY,  1970. 

[KahS]  Kahane  J.-R,  Salem  R.,  Ensembles  parfaits  et  series  trigonomitriques.  Paris,  Hermann,  1963. 

[Katz]  Katznelson  Y.,  An  introduction  to  harmonic  analysis  (Second  edition).  Dover,  NY,  1976. 

[Kool]  Koosis  R,  Introduction  to  H^  spaces.  Cambridge  University  Press,  Cambridge,  1980. 

[Koo2]  Koosis  R,  The  logaritmic  integral, vols. I-II .  Cambridge  University  Press,  Cambridge,  1988,  1992. 

[Koo3]  Koosis  R,  Legons  sur  le  th/or^me  de  Beurling  et  Malliavin.  Les  publications  CRM,  Montri^al,  1996. 

[Kor]  KornerT.W.,  Uniqueness  for  trigonometric  series.  Ann.  of  Math.,  126,  Nl,  1987,  p.  1-34. 

[L]  Levin  B.Ya.,  Distribution  of  zeros  of  entire  functions.  AMS,  Providence,  1964. 

[Lev]  Levinson  N.,  Gap  and  density  t/icorew^.Amer.Math.Soc.Coll.  Publ.,  NY,  1940. 


V.P.  Havin  /  On  the  Uncertainty  Principle  in  Harmonic  Analysis 


29 


[M]  Mandelbrojt  Sz.,  Series  adherentes,  r^gularisation  des  suites,  applications.  Gauthier- Villars,  Paris,  1952. 
[Na]  Nazarov  F.,  Local  estimates  of  exponential  polynomials  and  their  applications  to  the  uncertainty  principle. 
Algebra  i  analiz,  5,  N4,  1993,  p.  3-66  (Russian;  English  translation:  St.Petersburg  Math.J.,  5,  N4,  1994, 
p.  229-286). 

[Nik]  Nikolskii  N.K.  Treatise  on  the  shift  operator.Springer-Verlag,  NY,  1986. 

[PW]  Paley  R.E.A.C.,  Wiener  N.,Fc>Mner  transform  in  complex  domain.  AMS  Colloq.Publ.XIX,  Providence, 
1934. 

[Pey]  Peyriere  J.,  Etude  de  quelques  proprietes  de  produits  de  Riesz.  Ann.  Inst.  Fourier,  25,  N2,  1975,  p.  127- 
169. 

[Pr]  Privalov  I.I.,  Boundary  properties  of  analytic  functions.  GITTL,  Moscow,  1950  (Russian). 

[W]  Wollf  T.,  Counterexamples  with  harmonic  gradients  in  .  In  Essays  on  Fourier  analysis  in  honor  of  Elias 
M.Stein,  Princeton,  NJ,  1995,  p.  321-384. 

[Z]  Zygmund  A.,  Trigonometric  series.  Vol.  I-II.  Cambridge  Univ.  Press,  Cambridge,  London,  NY,  1959. 


Operator  Theory  and  Harmonic  Analysis 


H.S.  Shapiro 

Department  of  Mathematics 
Royal  Institute  of  Technology 
S-100  44  Stockholm 
Sweden 

shapiro@math . kth . se 

Abstract.  There  are  very  close  ties  between  the  two  subjects  of  the  title.  In  the 
early  days  of  operator  theory,  this  was  mainly  manifested  in  applications  of  harmonic 
(and  complex)  analysis  to  operator  theory.  For  example,  von  Neumann  proved  that  the 
norm  of  p(T'),  where  T  is  any  contraction  on  Hilbert  space  and  p  a  polynomial,  cannot 
exceed  the  maximum  of  \p{z)\  for  2;  on  the  unit  circle,  as  an  application  of  complex 
analysis.  In  like  manner,  Stone’s  theorems  on  groups  of  unitary  operators  were  proved 
with  the  aid  of  Bochner’s  theorem  characterizing  those  functions  on  the  circle  (or  on 
the  reals)  which  are  the  Fourier  transforms  of  positive  measures.  The  spectral  theorem 
itself  was  obtained  by  various  routes  based  on  Bochner’s  theorem,  the  trigonometric 
and/or  algebraic  moment  theorem,  etc. 

What  is  of  more  recent  date  (and  the  main  focus  of  this  exposition)  are  deep  and 
interesting  applications  of  operator  theory  to  classical  analysis.  A  breakthrough  here 
was  Bela  Sz.-Nagy’s  discovery  that  every  linear  operator  on  a  Hilbert  space  has  a  uni¬ 
tary  dilation;  without  here  entering  into  details  this  implies  that  many  properties  of 
general  contractions  can  be  deduced  from  corresponding  properties  of  operators  in  the 
much  nicer  class  of  unitary  ones,  to  be  sure  on  a  vastly  bigger  space  but  nonetheless 
allowing  many  nontrivial  deductions.  For  example,  von  Neumann’s  theorem  men¬ 
tioned  earlier  is  thus  reduced  to  the  corresponding,  and  much  easier,  problem  where 
T  is  unitary.  We  can  here  truly  say  that  operator  theory,  involving  genuinely  infinite¬ 
dimensional  constructions,  starts  to  have  applications  to  concrete  classical  problems. 
A  beautiful  case  in  point  is  D.  Sarason’s  recognition  of  the  fundamental  role  played 
by  commutativity  in  understanding  (and  getting  radically  new  proofs  of)  classical  in¬ 
terpolation  and  moment  theorems,  like  those  usually  associated  with  the  names  of 
Pick-Nevanlinna,  Caratheodory,  and  Schur.  The  abstract  kernel  of  this  approach  was 
further  clarified  and  generalized  as  the  so-called  “commutant  lifting  theorem”  of  Sz.- 
Nagy  and  Foias.  This  in  turn  led  to  new  and  far-reaching  extensions  of  the  classical 
results  e.g.  to  matrix- valued  functions,  which  also  have  important  applications  e.g.  in 
control  theory. 
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1.  Introduction 

Since  the  dawning  of  functional  analysis  in  the  early  years  of  the  twentieth  century,  it  has 
had  much  interaction  with,  and  inspiration  from,  harmonic  analysis.  Formal  trigonometric 
series  were  the  original  impetus  to  various  notions  of  generalized  functions  (distributions, 
hyperfunctions),  absolutely  convergent  trigonometric  series  were  the  original  model  for  what 
was  to  become  the  theory  of  normed  rings  (Banach  algebras),  and  so  on.  Indeed,  it  is  well 
known  that  the  Lebesgue  integral  itself  was  first  conceived  in  connection  with  the  study  of 
trigonometric  series. 

In  this  talk  I  want  to  focus  on  a  part  of  this  interaction,  that  between  harmonic  analysis 
(understood  here  in  a  broad  sense,  so  as  to  encompass  e.g.  Hardy  spaces  of  analytic  func¬ 
tions  on  the  unit  disk)  and  linear  operators  in  Hilbert  space.  That  these  have  close  links  is 
evident  if  one  considers  (we  shall  further  develop  this  point  shortly)  that  the  solution  to  the 
trigonometric  moment  theorem  is  grosso  modo  equivalent  to  the  spectral  theorem  for  unitary 
operators.  The  “common  denominator”  in  this  case  is  positive  definiteness :  characteristic  for 
a  Hilbert  space  H  is  its  inner  product,  a  positive  definite  Hermitian  form  on  H  x  H-,  whereas 
positive  definite  functions  on  groups  (and  semigroups)  are  a  central  notion  in  harmonic  anal¬ 
ysis. 

Generally  speaking,  the  interplay  between  classical  analysis  and  functional  analysis  be¬ 
gins  by  someone  taking  a  “second  look”  at  some  classical  theorem,  and  finding  that  it  contains 
the  seeds  of  something  more  general.  One  postulates  some  abstract  object  having  only  part 
of  the  features  of  the  original  classical  one,  and  tries  to  deduce  analogous  results.  This  pro¬ 
cedure  does  not  always  lead  to  fruitful  generalizations,  but  in  rare  cases  “a  miracle  happens” 
and  the  general  theory  returns  more  than  was  put  into  it  (thus,  for  example,  the  theory  of 
commutative  Banach  algebras,  originally  modelled  on  absolutely  convergent  trigonometric 
series,  turns  out  to  be  the  tool  par  excellence  for  studying  spectral  decomposition  of  normal 
operators  on  Hilbert  space). 

In  most  cases  there  occurs  no  such  miracle,  but  none  the  less  the  perspective  opened  up  by 
abstract  thinking  may  focus  researchers  upon  new  kinds  of  questions  which  in  turn  stimulate 
the  development  of  classical  analysis,  and  channel  thought  into  new  pathways.  Thus,  for 
example,  study  of  the  invariant  subspace  problem  for  general  linear  operators  on  Hilbert 
space  has  not  thus  far  “paid  off”  by  e.g.  clarifying  the  structure  of  non-normal  operators  in 
a  way  that  could  be  compared  with  what  the  spectral  theorem  accomplishes,  and  may  never 
do  so.  But,  there  has  been  much  valuable  “fallout”  from  the  massive  efforts  that  have  been 
expended  on  this  problem.  For  example,  finding  the  invariant  subspaces  for  just  one  concrete 
integral  operator  (the  so-called  Volterra  operator)  has  led  to  a  new  and  beautiful  proof  of 
a  deep  “classical”  theorem  of  Titchmarsh  on  the  support  of  a  convolution  of  two  functions. 
Quest  of  invariant  subspaces  for  subnormal  operators  has  stimulated  profound  researches 
into  approximation  by  rational  functions  in  the  complex  plane,  which  already  has  produced 
results  of  independent  importance. 
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The  present  talk  is  intended  to  present,  for  non-specialists,  a  small  but  hopefully  inter¬ 
esting  body  of  results  illustrating  the  aforementioned  interplay.  I  assume  familiarity  with  the 
notion  of  a  Hilbert  space,  and  shall  adopt  the  following  notations  and  conventions. 

All  Hilbert  spaces  are  complex  and  separable.  In  a  Hilbert  space  if,  {/,  g)  denotes  the 
inner  product  of  elements  /  and  g.  When  several  Hilbert  spaces  are  involved,  a  subscript  as 
in  (/,  g)H  may  be  used  to  specify  in  which  Hilbert  space  the  inner  product  is  intended.  By 
ll/ll  (or  ll/ll /f)  the  norm  of  /  is  denoted.  An  operator  T  between  Hilbert  spaces  H  and  K 
refers  to  a  continuous  linear  map  from  H  into  K ;  when  K  —  H,  say  T  is  an  operator 
on  H.  The  set  of  all  operators  from  if  to  AT  is  denoted  L{H,K),  and  L{H,  H)  is  usually 
abbreviated  L{H). 

The  span  of  a  subset  E  of  H  denotes  the  set  of  finite  linear  combinations  of  elements  of 
E  with  complex  coefficients.  The  closed  span  of  E  is  the  closure  of  this  set. 

Other  notions  like  subspace,  adjoint,  etc,  which  will  be  used  are  completely  standardized, 
and  the  reader  may  refer  to  any  of  the  textbooks  such  as  [AkGl],  [DuSc],  [Ma]  or  [RiSz]. 
Subspaces  will  be  tacitly  assumed  to  be  closed.  More  specialized  notions  and  notations  will 
be  defined  as  needed. 

The  following  basic  geometric  result  will  be  needed  several  times. 

Lemma  1.1.  Let  H,  H’  be  Hilbert  spaces,  and  E  any  subset  of  H.  Suppose  (p  is  any 
injective  map  from  E  into  H',  such  that 

(1-1)  {‘Pf,V>9)H' =  {f,9)H 

for  all  f,  g  in  E.  Then,  there  is  a  continuous  linear  map  V  from  the  closed  span  of  E  onto  the 
closed  span  of(p{E),  which  moreover  is  an  isometry,  such  that  V\e  =  p. 

For  the  proof,  see  e.g.  [AkGl,  p.  77]. 

Outline  of  the  talk.  In  §  2  we  shall  deduce  the  spectral  theorem  for  unitary  operators,  at 
least  in  a  special  form,  from  the  Herglotz-F.  Riesz-Toeplitz  characterization  of  the  Fourier 
coefficients  of  positive  measures  on  the  circle,  and  discuss  “the  basic  a  priori  inequality”  that 
results  therefrom  and  that  underlies  iho  functional  calculus. 

In  §  3  we  discuss  the  theorem  of  Wold-von  Neumann-Kolmogorov  on  the  structure  of 
isometries,  and  the  application  of  this  to  Beurling’s  invariant  subspace  theorem,  and  to  prob¬ 
lems  of  extrapolation  and  prediction  for  stationary  random  processes. 

In  §  4  we  discuss  the  theorems  of  B.  Sz.-Nagy  on  isometric  lifting  and  unitary  dilation  of 
contractive  operators,  with  applications  (von  Neumann  inequality,  mmean  ergodic  theorem). 

In  §  5  we  present  some  further  ramifications  of  dilation  theory,  notably  the  “commutant 
lifting  theorem”  and  discuss  its  application  to  problems  of  interpolation  (of  the  type  of  the 
Pick-Nevanlinna  problem)  and  moment  theorems. 

For  the  most  part,  we  shall  not  give  detailed  proofs,  but  try  to  convey  some  of  the  main 
ideas  and  techniques  needed  in  the  proofs.  We  cannot  give  a  complete  bibliography  (the 
Pick-Nevanlinna  theorem  alone  would  involve  us  with  hundreds,  if  not  several  thousands,  of 
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references)  but  believe  that  the  references  we  give,  together  with  references  in  those  works, 
suffice  to  give  the  interested  reader  a  good  orientation  in  the  literature. 

2.  The  spectral  theorem  and  harmonic  analysis 

The  starting  point  of  our  story  will  be  the  following  theorem,  discovered  independently 
by  G.  Herglotz,  F.  Riesz  and  O.  Toeplitz  ( [He,  Ri,  To]). 

Theorem  2.1.  Given  a  sequence  of  complex  numbers,  a  necessary  and  suffi¬ 

cient  condition  that  there  exists  a  bounded  non-negative  measure  p  on  the  (Borel  sets  of  the) 
unit  circle  T  satisfying 

(2.1)  J  e-‘’'^diJ.{9)  =  Cn  ,  neZ 

is:  for  every  ^  0,  and  all  choices  of  complex  numbers  we  have 

N 

(2.2)  >  0- 

j,k=0 


Remarks. 


(i)  Condition  (2.2)  can  also  be  formulated  so  that  the  principal  finite  sections  of  the  infinite 
matrix 


(2.3) 


Co 

Cl 

C2 

C3 

C-1 

Co 

Cl 

C2 

C_2 

Cl 

Co 

Cl 

are  non-negative  definite  matrices.  The  matrix  (2.3)  is  called  a  Toeplitz  matrix;  this 
designates  a  matrix  whose  entries  are  constant  along  any  diagonal  parallel  to  the  main 
diagonal. 

(ii)  From  (2.2)  it  follows  readily  that  c_n  —  c„  for  all  n  (in  particular,  cq  is  real  (and 
non-negative)).  These  relations  also  follow  at  once  from  (2.1). 

(iii)  This  theorem  is  nowadays  seen  as  a  special  case  of  a  more  general  theorem  of  Bochner 
about  positive  functions  on  locally  compact  Abelian  groups,  see  e.g.  [Ru]. 

(iv)  In  some  variants  one  replaces  (2.1)  by  the  equivalent  statement  that  the  harmonic  func¬ 
tion 

00 

(2.4)  u{r,e)  := 

n=—oo 


in  the  open  unit  disk  P  is  non-negative. 

We  refer  for  the  proof  of  Theorem  2.1  to  [Zy,  p.  138]. 
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One  of  the  well-known  deductions  of  the  spectral  theorem  for  unitary  operators  is  based 
on  Theorem  2.1.  Let  us  trace  the  main  ideas.  Recall  first  that  an  operator  U  in  L{H)  is  unitary 
if  it  is  a  bijection  of  H  that  preserves  inner  products:  {U  f,Ug)  =  (/,  g)  for  all  f^g  in  H  (or, 
what  comes  to  the  same,  U*U  —  UU*  =  /,  the  identity  operator). 


Definition  2.2.  The  unitary  operator  U  has  simple  spectrum  if  there  exists  a  vector 
f  G  H  such  that  the  linear  manifold  spanned  by  {[/”/  :  n  6  Z}  is  dense  in  H. 


Theorem  2.3.  (spectral  theorem  for  unitary  operators  with  simple  spectrum).  IfUe 
C(H)  is  unitary  with  simple  spectrum,  there  is  a  bounded  positive  measure  ponf  such  that 
the  linear  operator  M  on  (T,  dp)  defined  by 

(2.5)  M  Zip  ,  ip  £  L^(T,  dp) 

where  z  =  denotes  a  generic  point  ofT,  is  unitarily  equivalent  to  U. 

If  H  and  K  are  Hilbert  spaces,  operators  A  e  L{H)  and  B  e  L(K)  unitarily  equiva¬ 
lent  if  and  only  if  there  is  a  unitary  map  U  G  L{H,  K)  such  that  the  diagram 

H  ^  H 


(2.6)  f/j  |c/ 

K  — ^  K 

commutes  i.e.  BU  =  UA.  (This  is  also  expressible  as  “C/  intertwines  A  and  5”).  The 
analogous  weaker  relation  when  U  is  merely  assumed  to  have  an  inverse  in  L{K,  H)  is  called 
similarity. 

Remark.  Observe  that  M  has  a  simple  spectrum,  since  trigonometric  polynomials  are  dense 
in  L2(T,^). 

Proof  of  Theorem.  Define 


(2.7)  Cn  =  {U^fJ)  ,  nez 

where  f  e  H  is  such  that  {U^f}^  span  H.  Then,  {c„}  satisfy  (2.2),  since 

N  N 

j,k—0  j,k=0 


N 


j-0 


>0. 


Hence,  by  Theorem  2.1,  there  is  p  in  M‘*'(T)  (the  class  of  bounded  positive  measures  on  T) 
such  that  (2.1)  holds,  and  hence 
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Writing  n  =  j  -  k,  this  says  that  {U^f,  U^f)  equals  the  inner  product  of  with 
in  -L^(T,  dfj,).  By  virtue  of  Lemma  1.1,  there  exists  a  unitary  map  V  from  H  onto  L^(T,  dfi) 
such  that 

VWf  ^  ,  jeZ. 

Then,  if  M_  denotes  multiplication  by  on  L^{T,  dji),  the  diagram 

H  H 

V 

L^{T,dfi)  - ^  L'^{T,dfi) 

M- 

commutes  (it  is  enough  to  check  that  M-Vg  =  VUg  holds  for  a  set  of  g  whose  linear 
combinations  are  dense  in  H,  e.g.  for  g  =  with  n  e  Z,  and  that  is  immediate).  This 
proves  the  theorem  (with  M_  instead  of  M,  but  clearly  and  M  are  unitarily  equivalent). 

Corollary  2.4.  For  U  as  in  Theorem  2.3,  ifp{z)  = 

(2.8)  ||p(C/)||  <  max|p(2)|. 


Proof.  Because  of  the  unitary  equivalence,  it  is  enough  to  show  that  ||p(Af)||  max  |p(e*^)|, 
i.e.  that  \\p(M)g\\  ^  max  |p(e'^)  |  •  ||c?||  for  all  g  e  L‘^{T,  /i).  But  p{M)g  =  p(e^^)g  so 

(2.9)  \\p{M)gf  =  j  |p(e*'')p|5(e‘^)fd//^  (max  |p(e‘*) |^)||5|p 

which  completes  the  proof. 

Now,  what  about  a  unitary  operator  that  does  not  have  simple  spectrum?  Suppose  /i  is 
any  unit  vector  in  H,  and  let  Hi  denote  the  closed  span  of  {[/"/i  :  n  6  Z}.  \f  Hi  —  H  we 
are  in  the  case  already  treated,  so  suppose  the  contrary.  Then,  pick  a  unit  vector  /2  in  ©  Hi 
(the  orthogonal  complement  of  Hi)  and  let  H2  :=  closed  span  oi  {U^f 2  :  n  G  Z}.  After  at 
most  a  countable  number  of  steps  in  this  way,  we  obtain  a  decomposition  of  H  as  the  sum  of 
mutually  orthogonal  subspaces  Hj  such  that: 

(i)  Hj  is  invariant  for  U  and  U*  =  U~'^  (in  other  words,  the  Hj  reduce  U). 

(ii)  The  restriction  of  U  to  Hj  (also  called  the  part  ofU  in  Hj)  is  unitarily  equivalent  to 
the  operator  “multiplication  by  2:”  on  L^(T,  dpj)  for  some  measure  pj  e  M'^(T). 

Therefore,  introducing  a  new  Hilbert  space  K,  the  direct  sum  of  the  L^(T,  dpj)  (whose 
elements  are  thus  vectors  (p  =  (991,  (^25  •  •  • )  where  (pj  e  L'^{T,dpj)),  and  with  ||(^|p 
X]  f  is  easy  to  check  that  V  is  unitarily  equivalent  to  “multiplication  by  2:”  on  K. 

Then  (once  some  boring  questions  about  measurability  etc.  are  disposed  of)  we  can  deduce 
(now,  with  no  hypothesis  of  “simple  spectrum”): 
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Theorem  2.5.  IfU  is  unitary,  then 
(2.10)  l|p(!^)ll  <  max  |p(e")  |  =;  ||p||oo 

for  every  Laurent  polynomial  p{z)  =  Xl-w 

Remarks.  We  could  call  (2.10)  the  fundamental  a  priori  inequality  for  unitary  operators.  It 
is  the  “high  ground”  from  which  one  can  (by  many  laborious  steps,  to  be  sure)  build  up  the 
full  spectral  theorem  for  unitary  operators  (which  we  have  no  need  to  formulate  here;  for  this 
whole  development  the  reader  is  referred  to  [RiSz,  §  109]  or  [AkGl,  Chapter  6]). 

The  “model”  we  have  constructed  for  U,  as  “multiplication  by  2:  on  is  not  a  fully 
satisfactory  one  (even  though  it  suffices  for  the  deduction  of  the  important  Theorem  2.5) 
because  K  is  not  a  uniquely  determined  or  “canonical”  space,  in  general  many  different 
choices  for  the  Hj  are  possible.  This  is  at  bottom  the  problem  called  “spectral  multiplicity” 
and  can  be  dealt  with  by  methods  in  [DuSc,  Chapter  10]  or  [Hal].  One  can  also  bypass  it, 
and  use  Theorem  2.5  as  the  point  of  departure  for  the  full  spectral  theorem  (see  remarks  and 
references  below). 

A  parallel  development  can  be  done  for  self-adjoint  operators:  if  T  —  T*,  F.J.  Murray 
showed  by  an  elementary  argument  that  for  every  polynomial  /  (A)  = 
being  here  restricted  to  real  values) 

(2-11)  ||/(r)Kmax|/(A)UA|<||r||. 

See  [Ma,  p.  100],  where  (2.11)  is  proven,  and  from  it  the  spectral  theorem  deduced,  following 
an  argument  of  Eberlein.  Alternatively,  the  spectral  theorem  for  self-adjoint  operators  can  be 
deduced  from  that  for  unitary  operators  (and  vice  versa)  by  using  “Cayley  transforms”. 

Let  us  also  observe  that  the  spectral  theorem  for  several  commuting  unitary  operators  can 
be  deduced  from  the  multivariable  generalization  of  the  Herglotz-F.  Riesz-Toeplitz  theorem 
(itself  a  special  case  of  Bochner’s  theorem)  by  adapting  the  argument  we  have  given  above 
in  the  case  of  one  operator.  Again,  using  Cayley  transforms  this  can  be  carried  over  to  sev¬ 
eral  commuting  self-adjoint  operators.  (The  case  of  two  commuting  self-adjoint  operators  is 
equivalent  to  the  case  of  one  “normal”  operator,  i.e.  an  operator  commuting  with  its  adjoint). 

We  shall  not  dwell  on  the  matter.  There  are  a  great  many  known  proofs  and  variants 
of  the  spectral  theorem.  One  of  the  most  powerful  methods  known  for  proving  them  uses 
the  theory  of  commutative  Banach  algebras  to  obtain  the  Gelfand-Naimark  theorem,  giving 
spectral  resolutions  for  commuting  families  of  normal  operators  (  [DuSc,  Chapter  9].  The 
“complete  spectral  theorem”  of  von  Neumann  gives  a  canonical  representation  of  a  normal 
operator  as  a  “direct  integral”  independently  of  any  multiplicity  assumptions  regarding  the 
spectrum. 

It  is  characteristic  that,  regardless  of  the  particular  features  of  these  various  approaches, 
they  are  all  deeply  influenced  by  ideas  originating  in  harmonic  analysis.  In  the  opposite 
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direction,  the  spectral  theorem  has  applications  to  harmonic  analysis:  the  reader  is  invited  to 
deduce  the  Herglotz-F.  Riesz-Toeplitz  theorem  from  Theorem  2.3. 

Before  ending  this  section,  a  few  words  are  in  order  about  “functional  calculus”.  For 
any  operator  T,  with  spectrum  o-(T),  there  is  an  obvious  way  to  define  /(T),  where  /  is 
any  rational  function  with  no  poles  on  a{T)  (for  example,  for  f{z)  =  (z  -  a)~^,  f(T)  is 
defined  to  be  (T  —  a7)“\  where  1  denotes  the  identity  operator,  which  is  well  defined  for 
a  G  C  \  cr(T)).  We  thus  get  a  “functional  calculus”,  that  is  a  map  /  (-)■  /(T)  which  is  a 
continuous  algebra-homomorphism  from  the  set  of  rational  functions  with  no  poles  on  cr(T), 
to  L(H).  Moreover,  a  more  general  calculus  (the  “Riesz-Dunford  calculus”)  can  be  built 
up,  based  on  Cauchy  integrals,  the  “domain”  of  which  is  all  functions  f  holomorphic  on  a 
neighborhood  of  cr(T).  One  of  the  great  thematic  problems  of  operator  theory  is  to  enlarge 
the  domain  of  functional  calculus  beyond  these  holomorphic  /.  But,  this  cannot  be  done  so 
as  to  apply  to  all  operators.  For  special  classes  of  operators  T,  however,  one  can  widen  the 
class  of  /  such  that  /(T)  has  meaning.  This  is  closely  related  to  estimates  for  ||/(T)||,  when 
/  is  rational;  we  shall  return  to  this  point  in  §  4  in  connection  with  von  Neumann’s  inequality. 
For  example,  the  estimate  (2.10)  makes  possible  a  useful  definition  of  f{U)  for  any  unitary 
operator  U,  for  any  /  continuous  on  the  unit  circle  and  even  (ultimately)  for  any  bounded 
Borel  function  on  cr{U)\  full  details  of  this  are  in  the  already  cited  textbooks. 


3.  Isometries,  and  the  Wold  decomposition 

Once  we  leave  the  category  of  normal  operators,  we  cannot  expect  unitarily  equivalent 
models  so  simple  and  transparent  as  “multiplication  by  some  bounded  function  on  an  L‘^{p) 
space”.  Nevertheless  there  are  important  non-normal  operators  that  impose  themselves  on  us, 
and  much  of  the  operator  theory  of  recent  years  is  dedicated  to  finding  structural  properties, 
or  models  of  some  kind,  for  various  classes  of  non-normal  operators.  In  this  section  we  dis¬ 
cuss  isometries.  A  fundamental  theorem  attributed  to  von  Neumann,  Wold,  and  Kolmogorov 
(independently)  is 

Theorem  3.1.  (“Wold  decomposition”).  Let  V  be  an  isometry  on  a  Hilbert  space  H. 
Then,  there  is  a  (uniquely  determined)  decomposition  of  H  as  the  direct  sum  of  two  mutually 
orthogonal  subspaces  H  =  Hi  ^  H2  such  that 

(i)  Hi  and  H2  reduce  V. 

(ii)  The  part  ofV  in  Hi  is  unitary. 

(iii)  The  part  ofV  in  H2  is  unitarily  equivalent  to  a  block  shift. 

Before  proceeding,  let  u£  explain  the  terminology.  If  K  is  any  Hilbert  space,  we  may 
construct  from  it  a  new  one  K  whose  elements  are  sequences  from  K,  that  is  vectors 

k:={ko,kuk2,...)  ,  kj  e  K 
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such  that  ||fc  II  ~  :=  ||/^j|p  <  oo.  The  map  S  e  L(K)  taking  k  to  (0,  ki,...)is  called 

the  shift  operator  on  K,  It  is  an  isometry,  but  certainly  (assuming  K  has  dimension  at  least 
one)  it  is  not  surjective,  and  so  not  unitary.  It  is  not  hard  to  show  that,  for  any  two  Hilbert 
spaces  Ki  and  K2,  the  shifts  on  Ki  and  K2  are  unitarily  equivalent  if  and  only  if  dim  Ki 
(the  dimension  of  Ki)  equals  dim  K2.  Thus,  each  shift  has  associated  to  it  a  unique  unitary 
invariant,  a  positive  integer  or  +00  which  is  the  dimension  of  the  space  K  from  which  K  is 
formed.  It  is  called  the  multiplicity  of  the  shift.  Since  some  authors  use  the  term  to  denote 
exclusively  the  shift  of  multiplicity  one,  we  use  the  term  block  shift  to  denote  a  general  shift. 

A  shift  (or  block  shift)  not  only  is  not  unitary,  it  is  as  far  from  unitarity  as  an  isometry  can 
be.  Indeed,  clearly 

lim5”fc=:0  ,  foralUe^. 

n->oo 

We  may  remark,  in  passing,  that  this  relation  is  the  basis  for  an  abstract  (axiomatic)  definition 
of  a  block  shift  (cf.  [GoGoKa]).  The  block  shift  as  defined  above  is  also  called  unilateral;  this 
is  something  of  a  misnomer — the  same  shift  is  called  bilateral  if  the  ambient  Hilbert  space 
of  sequences  is  doubly  infinite  (like  .  .k-i,ko,ki,. .  .))in  which  case  it  is  unitary. 

If  we  examine  the  proof  of  Theorem  3.1  (which  we  shall  not  give  here,  see  [GoGoKa, 
p.  654])  we  note  the  following  features: 

a)  Hi  is  the  intersection  of  the  ranges  of  all  V”,  n  ^  1. 

b)  H2  is  a  block  shift  of  dimension  dim  W,  where  W  is  the  kernel  of  V*.  Moreover,  H2 

is  the  direct  sum  of  the  subspaces  W,  VW,  which  are  mutually  orthogonal. 

(For  this  reason,  W  has  been  christened  “wandering  subspace”  belonging  to  V,  by 
Halmos.  It  is  also  called  the  defect  space  of  V  for  the  obvious  reason  that,  being  the 
orthogonal  complement  of  the  range  of  V,  it  indicates  how  far  V  falls  short  of  being 
unitary.) 

Of  course,  Hi  can  be  {0}  (and  then  is  a  (“pure”)  shift,  or  H2  can  be  {0}  (and  then  V  is 
unitary,  and  W  =  {0}). 

Theorem  3.1  has  some  remarkable  consequences,  and  we  shall  discuss  two  of  these,  plus 
a  remarkable  generalization  obtained  recently  by  S.  Shimorin. 

First  of  all,  following  P.  Halmos  [Ha2]  we  shall  outline  a  deduction  from  Theorem  3.1 
of  Beurling’s  famous  invariant  subspace  theorem.  (A  subspace  M  of  is  invariant  for  the 
operator  T,  if  TM  c  M.)  Before  formulating  it,  let  us  recall  some  definitions  and  notations. 
By  I/{T)  for  p  >  0  we  denote  the  usual  Lebesgue  space  of  measurable  functions  on  the  unit 
circle  T,  endowed  with  the  norm  ||  ||p  where 

J ’  ll/IU  =esssup|/(e'^)|. 

T 

For  p  ^  1,  LP(T)  is  a  Banach  space.  By  iif^(T),  for  p  ^  1  we  designate  the  subspace 
of  LP(T)  consisting  of  those  functions  whose  negatively  indexed  Fourier  coefficients  are  all 
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zero.  As  is  well  known  (  [Du],  [Ga]),  each  function  in  H^{T)  is  the  nontangential  limiting 
value  almost  everywhere  of  a  holomorphic  function  g  in  the  open  unit  disk  ED  such  that 

0 

is  finite.  The  correspondence  of  g  with  its  boundary  values  induces  an  isometric  isomorphism 
between  and  H^(T).  For  all  of  this  see  e.g.  [Du],  to  which  we  also  refer  for  certain 

special  terminology  like  inner  and  outer  functions,  etc. 

The  shift  on  the  space  of  square  summable  sequences 

a  -  {ao,ai,Q2,---} 

of  complete  numbers  is,  in  an  obvious  way  resulting  from  Parseval’s  identity,  unitarily  equiv¬ 
alent  to  the  operator  “multiplication  by  on  and  also  to  “multiplication  by  z”  on 

To  avoid  excessive  pedantry  we  shall  feel  free  to  use  the  term  “shift  operator”  to 
denote  any  of  these,  according  to  the  context  of  the  discussion.  Now,  we  state 

Theorem  3.2.  (A.  Beurling,  1949).  The  invariant  subspaces  of  the  shift  operator  on 
are  precisely  the  sets 

(3.1)  :=  {^f  :  f  e  H^T)} 

where  ip  is  a  unimodular function  in  H°°(T)  (“inner function”). 

Proof.  It  is  clear  that  a  set  of  the  form  (3.1)  is  a  vector  subspace  of  //^(T),  also  it  is  closed 
(being  the  range  of  the  isometric  operator  “multiplication  by  p”)  and  invariant  with  respect  to 
the  shift  (here,  multiplication  by  e^^).  The  deeper  part  of  the  theorem  is  the  converse  direction, 
whose  proof  a  la  Halmos  [Ha2]  we  now  proceed  to  sketch. 

Thus,  let  M  be  a  (proper)  invariant  subspace.  Then,  restricted  to  M,  the  shift  is  an 
isometry  which  we  denote  by  V,  and  to  this  we  apply  Theorem  3.1  with  M  in  the  role  of 
the  Hilbert  space  H  there.  It  is  easy  to  check  that  the  intersection  of  the  ranges  of  V"  for 
n  —  1, 2, . . .  is  {0},  so  V  is  unitarily  equivalent  to  a  block  shift.  It  is  intuitively  plausible  that 
the  multiplicity  of  this  block  shift  is  one,  but  that  is  no  tautology,  and  indeed  verification  of 
this  is  the  heart  of  the  proof.  We  must  show  that  the  wandering  space  W  is  one-dimensional. 
To  this  end,  let  (p  denote  any  vector  in  W,  which  in  our  context  is  identified  as  M  0  e*^M. 
Thus,  p  is  in  M,  and  orthogonal  to  e*^/  to  every  /  in  M. 

We  can  in  particular  choose  /  —  for  A:  =  0, 1, 2, . . .  and  this  gives 

y  |¥>|V"V6I  =  0  ,  n  =  l,2,.... 

T 

By  complex  conjugation  this  holds  also  for  all  negative  integers  n,  hence  \  p{e^^)\‘^  is  constant 
almost  everywhere  on  T. 


H.S.  Shapiro  /  Operator  Theory  and  Harmonic  Analysis 


41 


Thus,  W  is  SL  closed  vector  subspace  of  (T)  with  the  remarkable  property  that  every 
function  in  it  has  (almost  everywhere  on  T)  constant  absolute  value.  The  reader  should  have 
no  trouble  supplying  the  deduction,  from  this,  that  dim  W  is  at  most  one  (hence,  exactly  one, 
since  otherwise  V  would  be  a  unitary  map  of  M  onto  itself,  which  we  have  ruled  out). 

We  conclude  that  every  element  /  of  H^{T)  is  uniquely  representable  in  the  form 

oo 

n=0 

where  Wn  eW  and  ^  <  oo-  But,  Wn  =  Cn^  for  some  fixed  inner  function  (p,  where 

|cnp  <  OO.  Hence 

oo 

/(e'«)  =  ^(e") 

0 

which  completes  the  proof  of  the  theorem. 

One  advantage  of  this  proof  is  that  it  can  be  adapted  to  prove  generalizations  of  The¬ 
orem  3.2,  pertaining  to  Hardy  spaces  of  vector-valued  functions  (due  to  Lax  and  Halmos, 
see  [GoGoKa]  for  details  and  references).  These  generalized  versions  are  important  for  ap¬ 
plications  e.g.  in  systems  and  control  theory,  as  well  as  multivariate  stochastic  processes  and 
seem  very  difficult  to  prove  by  classical  arguments. 

Another,  very  important,  application  of  Theorem  3.1  is  to  wide-sense  stationary  random 
sequences.  Indeed,  this  was  the  source  of  the  interest  in  Theorem  3.1  on  the  part  of  Wold, 
and  of  Kolmogorov.  For  lack  of  space,  we  cannot  develop  this  here  in  much  detail,  but  refer 
for  the  background  to  [IbRo]  and  the  appendix  by  Peller  and  Khruschev  to  [Ni], 

We  consider  a  “random  sequence”  {Xn}nez-  The  Xn  are  elements  of  an  dp)  space 
for  some  measure  space  H  and  probability  measure  p  on  it  (we  omit  the  cr-algebra  of  subsets 
of  from  the  notation).  We  assume  the  Xn  all  have  mean  zero,  and  var^ce  1.  Moreover,  the 
sequence  {Xn}  is  “stationary”  in  the  sense  that  the  covariance  E{XmXn),  with  E  denoting 
mean  value,  depends  only  on  n  -  m. 

For  many  of  the  purposes  of  statistics,  we  can  ignore  the  underlying  “sample  space” 
(Qjdp)  and  study  the  more  abstract  model,  whereby:  {Xn}nez  o,re  unit  vectors  in  some 
Hilbert  space  H,  and  (X„i,  Xn)  depends  only  on  m  —  n.  We  may  also  suppose  that  H  is 
the  closed  span  of  {Xn}  (otherwise,  simply  re-define  H). 

Denoting  (Xm^Xn)  :=  Cm-n^  one  easily  checks  that  {ck}k£z  satisfy  the  hypotheses  of 
Theorem  2.1,  hence  there  is  a  positive  measure  ^  on  T  satisfying  f  e^^^dp{6)  =  for  all 
k  e  Z,  and  hence 

Thus,  by  Lemma  1.1,  there  is  a  unitary  map  U  from  H  onto  I/^(T,  dp)  carrying  Xn  onto 
for  all  n  eZ. 
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This  makes  possible  the  transformation  of  certain  “statistical”  questions  concerning 
into  purely  “analytical”  ones  in  L^(T,  dp).  Consider,  for  example,  the  problem  of  prediction. 
Let  us  denote,  for  any  integer  m,  the  closed  span  of  by  (“the  past  up  to  time 

m”).  Suppose  we  “know”  Xn  for  all  n  <  0,  and  wish  to  “predict”  the  value  of  Xq.  A  very 
widely  used  method  is  to  use  a  “least-squares  estimate”.  That  is,  use  as  the  “estimate”  for  Xq 
the  random  variable  Xq  which  is  defined  as  the  orthogonal  projection  of  Xq  on  P_i,  If  one 
assumes  (as  we  shall)  that  the  above  unitary  map  U  is  “known”  (also  called  “the  spectral  rep¬ 
resentation  of  the  random  sequence”),  then  this  problem  can  be  transformed  to  the  L^(T,  dp) 
context,  where  it  is  a  standard  type  of  approximation  problem  (weighted  approximation 
by  trigonometric  polynomials)  and  can  be  tackled  by  traditional  methods  (Gram-Schmidt  or- 
thogonalization,  etc.).  For  example,  the  variance  of  the  prediction  error  when  the  estimator 
Xq  is  employed,  ||Xo  —  Xo|P  can  in  principle  be  computed  as  the  squared  distance  from  the 
constant  function  1  to  the  closed  span  of  {e*”^}„<o  in  L^(T, //).  Hopefully,  this  conveys  a 
little  of  the  flavor  of  “prediction  theory”. 

Now,  there  is  an  important  unitary  map  V  of  H:  the  map  defined  for  all  nby  VXn  = 
Xn-i  is  easily  seen  to  extend  by  linearity  and  continuity  to  a  unitary  map  of  H  (which  we 
continue  to  denote  by  V).  (In  the  “spectral  model”  this  becomes  multiplication  by  e"*^). 

Now,  Po  is  invariant  for  V,  and  V  |  Pq  is  an  isometry  of  Pq.  Hence,  by  Theorem  3.1,  we 
get  a  Wold  decomposition  of  Pq  reducing  V  |  Pq.  What  are  the  two  components  into  which 
it  resolves? 

Let  us  look  first  at  the  extreme  cases: 

(i)  F  I  Po  is  unitary;  in  this  case,  by  VFq  =  Po,  so  Po  ~  P-i,  and  by  stationarity,  all 
P„,  n  G  Z  are  equal.  This  is  the  hallmark  of  a  purely  detenninistic  process.  By  means 
of  the  spectral  model,  we  know  this  happens  when,  and  only  when,  in  the  Lebesgue 
decomposition  of  the  spectral  measure  p, 

dp  —  dps  -f  wd9 

(where  Ps  is  singular  with  respect  to  dO,  and  w  e  L^(T,  d9))  we  have  Jjlog wd9  = 
-00.  This  is  a  famous  theorem  due  to  G.  Szegb,  M.  Krein  and  A.  Kolmogorov, 

(ii)  V"|Po  is  a  block  shift;  in  this  case,  fln^o  (called  the  “remote  past”)  reduces  to  {0}. 
The  best  least  squares  estimate  of  Xq  on  the  basis  of  “old”  information  P_^,  where 
m  is  a  large  positive  integer,  tends  in  norm  to  0  as  m  oo.  The  variance  of  the 
prediction  error  tends  to  1.  In  other  words,  the  best  least-squares  estimate  of  Xq  on 
the  basis  of  very  old  observations  is  simply  its  mean  value  (here  assumed  to  be  zero). 
Such  a  process  is  called  purely  indeterminate. 

The  general  case  is  an  amalgam  of  (i)  and  (ii),  that  is,  the  Wold  decomposition  in  the 
stochastic  model  says: 

There  is  a  unique  splitting  X„  =  X^  -f  X"  where  {X^}  is  purely  determinate  and  {X"} 
is  purely  indeterminate.  Their  closed  spans  H"  are  orthogonal  complements  in  H. 
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In  the  spectral  model,  the  splitting  can  be  described  explicitly:  it  mirrors  the  Lebesgue 
decomposition  of  dfi,  such  that  when  flogwdO  =  — oo,  wdd  is  grouped  together  with  dps 
and  we  have  a  purely  determinate  process. 

The  prediction  problem  is  only  one  of  a  great  many  problems  about  stationary  sequences, 
involving  questions  of  mixing,  regularity  and  so  forth.  These  translate  into  interesting  and 
sometimes  very  deep,  in  some  cases  still  unsolved  problems  of  harmonic  analysis.  For  further 
information  see  [IbRo],  and  the  Peller-Khruschev  appendix  to  [Ni]. 

In  closing  this  section,  let  us  remark  that  there  has  been  much  study  in  recent  years  of  the 
invariant  subspaces  of  the  operator  “multiplication  by  2:”  on  the  Bergmann  space  of  the  disk 
D,  that  is,  the  space  AL?{p)  of  analytic  functions  on  P  square  integrable  with  respect  to  area 
measure.  Here  the  situation  is  much  more  complicated  than  in  the  corresponding  Ff  ^  scenario 
of  Beurling’s  theorem.  For  one  thing,  “multiplication  by  2:”  is  now  a  contractive  mapping, 
but  not  isometric.  It  turns  out  that  there  are  invariant  subspaces  M  for  which  M  0  zM  is  not 
one-dimensional;  the  dimension  of  this  subspace  (analogous  to  the  “wandering  subspace”  in 
our  discussion  of  Beurling’s  theorem)  may  be  any  positive  integer,  or  even  infinity. 

Nevertheless,  it  has  been  proved  by  Aleman,  Richter  and  Sundberg  [AlRiSu],  that  the 
space  W  M  Q  zM  still  has  the  property  that,  together,  W,  zW,  span  M.  This 

remarkable  and  deep  result  inspired  S.  Shimorin  [Sh]  to  discover  a  new  general  theorem 
implying  a  Wold-type  decomposition  for  certain  classes  of  operators  (including,  of  course, 
all  isometries,  but  also  many  others)  which  in  particular  yields  the  main  result  of  [AlRiSu]. 

4.  Dilation  theory 

Let  us  start  by  reviewing  some  definitions.  If  A  €  L{H)  and  A'  €  where  H,  H' 

are  Hilbert  spaces  with  H  C  A'  is  said  to  lift  A  (or,  be  a  lifting  of  A)  if  the  diagram 


H’ 

>  H' 

(4.1) 

P 

p 

H 

>  H 

A 


commutes,  P  being  the  orthogonal  projector  of  H'  on  H;  that  is, 

(4.2)  AP  =  PA', 

Applying  A  to  both  sides  of  (4.2)  from  the  left,  gives  A^P  =  APA'  =  PA' A'  =  P(A')^,  and 
now  a  simple  inductive  argument  shows 

(4.3)  A"P  =  P(A')^  ,  n  =  1, 2, . . .  . 

If  T  G  L{H,  H'),  the  operator  PT  (in  L(H))  is  called  the  compression  ofTtoH.  It  is 
easy  to  check  that  this  is  equivalent  to,  denoting  PT  by  A, 

(4.4)  {Thi,  /12)  =  ^2)  for  all  hi  and  h2  in  H. 
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If  A'  is  a  lifting  of  A  to  H',  then  from  (4.3) 

(4.5)  {A^huh2)  =  {A^Ph,h2)  -  {P(AThuh2)  =  {{AThuh2) 

for  all  pairs  hi,h2  in  H.  Taking  n  —  1  and  comparing  with  (4.4)  shows:  if  A'  is  a  lifting  of  A, 
then  A  is  the  compression  to  H  of  the  restriction  A'  \  H.  But,  lifting  implies  more,  namely 
that  A^  is,  for  every  positive  integer  n,  the  compression  of  (/l'|i/)"  to  H. 

The  converse  is  not  true,  and  the  last  relation  turns  out  to  be  important  enough  to  be  given 
a  name: 

Definition  4.1.  If  A  e  L{H)  and  A!  G  L{H'),  where  H,  H'  are  Hilbert  spaces  with 
H  C  H',  A'  is  a  dilation  of  A  if  and  only  if  A^  is,  for  every  positive  integer  n,  the  compression 
of  {A'  I  H)^  to  Hy  or  what  is  equivalent 

(4.6)  {{A'yhuh2)  =  {A-h,:h2) 
for  every  pair  hi,  ^2  itt  H  and  every  positive  integer  n. 

Remarks.  Formerly,  the  term  “power  dilation”  was  sometimes  used  to  denote  this  relation. 

If  H  is  an  invariant  subspace  of  A\  and  A  is  the  part  of  A'  in  H,  then  clearly  A'  is  a 
dilation  of  A, 

Thus,  ''A'  is  a  dilation  of  A”  is  implied  by,  but  in  general  strictly  weaker  than  each  of  the 
assertions  “A'  is  an  extension  of  A”  and  “A'  is  a  lifting  of  A”. 

A  landmark  discovery  in  operator  theory,  due  to  B.  Sz.-Nagy,  is 

Theorem  4.2.  Every  contraction  (i.e.  operator  of  norm  at  most  one)  on  a  Hilbert  space 
has  a  dilation  that  is  unitary. 

One  can  formulate  this  theorem  more  precisely:  if  A  G  L{H),  then  there  is  a  unitary 
dilation  [/  of  A  to  some,  in  general  larger  Hilbert  space  H\  with  the  further  property  that  the 
closed  span  of  {U^H}  for  n  in  Z  is  H'.  This  unitary  dilation  is,  in  a  natural  sense  minimal 
in  that,  roughly  speaking,  is  no  larger  than  it  has  to  be  and  moreover,  modulo  a  natural 
concept  of  isomorphisms  of  unitary  dilations,  this  minimal  one  is  unique.  (For  details,  we 
refer  to  [SzFo].) 

To  get  a  feeling  for  this  remarkable  result,  the  reader  is  urged  to  try  to  construct  a  unitary 
dilation  (u.d.)  of  the  operator  A  that  is  identically  zero  on  H.  Any  u.d.  U  must  satisfy 
{U^hi,h2)  =  0  for  all  in  H,  from  which  it  follows  that  the  subspaces  {IP^H}  for 

71  G  Z  are  mutually  orthogonal,  i.e.  H  is  a  “doubly  wandering”  subspace  for  U  in  the  larger 
space  JT'  where  U  operates,  so  H'  must  be  quite  large! 

Thanks  to  Theorem  4.2  some  results  known  for  unitary  operators  can  be  carried  over  to 
contractions.  Thus, 

Corollary  4.3.  (von  Neumann’s  inequality)  Let  A  be  a  contraction  on  the  Hilbert 
space  H,  and  p  any  polynomial,  p(A)  —  ^hh  complex  coefficients.  Then 

(4.7)  lb(^)ll  ^  max  \p{X)\  A  G  C,  |A|  ^  1. 
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Proof.  Let  U  €  L{H')  be  any  unitary  dilation  of  A.  Then,  for  every  hi,  h2  in  H  we  have 

\(p{U)huh,)\  =  \(piA)huh,)\. 

Taking  the  supremum  of  the  right  side  over  all  unit  vectors  /12  in  H  gives 
(4.8)  ||p(A)/iiH||p([/)ft,|K||p(t7)||||fti|| 

(the  second  norm  being  in  the  space  H'), 

But,  l|p(C/)  II  does  not  exceed  the  maximum  of  |p(A)  |  for  A  on  the  unit  circle,  as  shown  in 
§  2.  Using  this  in  (4.8)  and  maximizing  over  unit  vectors  hi  in  H  gives  (4.7),  completing  the 
proof. 

Remarks.  On  the  basis  of  (4.7)  one  can  enlarge  the  functional  calculus  for  contractions  A,  to 
encompass  all  operators  f{A)  with  /  continuous  on  D  and  in  This  is,  in  turn,  close 

to  von  Neumann’s  notion  of  “spectral  sets”,  see  [RiSz],  [SzFo]. 

One  can  give  proofs  of  (4.7)  using  only  complex  analysis  and  elementary  Hilbert  space 
theoiy,  see  [RiSz].  Here  we  have  shown  its  derivation  from  Theorem  4.2  to  illustrate  a  typical 
application  of  that  theorem.  Another  very  nice  application  is  the  mean  ergodic  theorem, 
see  [RiSz]. 

We  shall  not  prove  Theorem  4.2,  referring  to  [SzFo]  or  [GoGoKa],  but  will  give  a  few 
indications.  First  of  all,  Theorem  4.2  follows  from 

Theorem  4.4.  Every  contractive  operator  on  a  Hilbert  space  has  an  isometric  lifting. 

To  see  why  this  implies  Theorem  4.2  we  require 

Lemma  4.5.  Every  isometric  operator  on  a  Hilbert  space  has  a  unitary  extension. 

Assuming  this  for  the  moment,  suppose  A  is  a  contraction  on  H  and  V  is  some  isometric 
lifting  to  H\  Let  U  be  a  unitary  extension  of  V  to  H".  Then,  in  view  of  earlier  remarks,  U  is 
a  dilation  of  V,  and  U  is  a  dilation  of  A.  It  follows  readily  that  C/  is  a  dilation  of  A. 

We’ll  give  an  informal  proof  of  Lemma  4.5.  Let  V  be  an  isometry  on  a  Hilbert  space  H. 
By  virtue  of  the  Wold  decomposition  there  is  a  splitting  of  H  =  Hi  ®  H2  reducing  V,  such 
that  V  \  Hi  is  unitary  and  V  \  H2  is  unitarily  equivalent  to  a  “unilateral  block  shift”,  that  is 
to  the  operator 

S  :w  :=  {wo,  Wi,W2,...)^  (0,  wq,  Wi,...) 

on  the  Hilbert  space  of  one-sided  sequences  of  elements  of  some  Hilbert  space  W,  with  norm 
of  w  equal  to  (Xl^  ^  taken  as  the  wandering  space  H  ©  VH.) 

Now,  S  has  an  obvious  unitary  extension  S,  namely  the  map  of  the  Hilbert  space  of  two-sided 
sequences  w  .W-i,Wq,'Wi,  .. .)  given  by  the  formula 

{Sw)^  =  Wn~i  ,  n  G  Z, 

the  “bilateral  shift”.  Indeed,  S  is  unitary  and,  restricted  to  the  subspace  for  which  all  Wi 
with  z  <  0  vanish  (which  in  an  obvious  way  is  identified  with  the  above  space  of  one-sided 
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sequences),  S  coincides  with  S.  Thus,  finally,  the  operator  equal  to  V  on  Hi  and  S  on  the 
appropriate  space  is  a  unitary  extension  of,  strictly  speaking,  an  operator  unitarily  equivalent 
to  V.  And,  it  is  easily  checked  that  if  an  operator  has  a  unitary  extension,  then  so  does  every 
operator  unitarily  equivalent  to  it  (this  is  because  a  unitary  operator  always  has  a  unitary 
extension  to  every  larger  Hilbert  space). 

Theorem  4.4  also  allows  a  strengthened  version,  whereby  the  isometric  lifting  is  to  a 
“minimal”  space,  and  this  “minimal  isometric  lifting”  is  essentially  unique,  for  details  we 
refer  again  to  [SzFo]  and  [GoGoKa]. 

As  to  the  proof  of  Theorem  4.4,  we  won’t  give  it  in  full  but  illustrate  one  of  the  main 
ideas  by  sketching  the  proof  of  an  earlier  (weaker)  version  due  to  Halmos,  ^  whose  pioneering 
writings  have  been  instrumental  for  modem  developments  in  operator  theory: 

Theorem  4.6.  Every  contractive  operator  on  a  Hilbert  space  is  the  compression  of  a 
unitary  operator 


Proof.  Let  A  e  L(H),  ||A||  ^  1.  Then,  I  -  A* A  and  I  -  AA*  are  self-adjoint  operators 
with  spectrum  contained  in  the  set  R+  of  non-negative  real  numbers.  By  the  basic  functional 
calculus  for  self-adjoint  operators,  they  have  “positive  square  roots”,  that  is,  there  exist  self- 
adjoint  operators  Da  and  Da*  with  spectmm  in  R+  such  that 

(4.9)  D\  =  I -A*  A 

{Da*)^  =  I-AA\ 

One  calls  Da  the  defect  operator  associated  to  A.  Observe  that  it  is  O  if  and  only  if  A  is 
an  isometry,  and  both  Da  and  Da*  are  O  if  and  only  if  A  is  unitary.  Now,  we  have 

A{A*Af  =  {AA*yA 

and  so,  for  every  polynomial  p{t)  =  Cq  +  Ci^  +  •  •  *  +  with  real  coefficients  Ap{A*A)  = 
p{AA*)A.  Here  we  can  let  p  run  through  a  sequence  {pj}  of  polynomials  converging  uni¬ 
formly  to  on  [0, 1].  Then,  by  functional  calculus,  \\pj{A*A)  -  Da\\  and  \\pj{AA*)  -  Da*  || 
tend  to  zero,  and  we  obtain  the  fundamental  intertwining  identity 


(4.10) 


AD  A  =  Da*  A. 


It  is  now  easy  to  prove  Theorem  4.6.  Let  A  be  a  contraction  on  H.  Then,  we  can  define 
an  operator  on  if  0  Ff  by  means  of  the  “block  matrix” 


(4.11) 


u  — 


A  Da* 
-Da  a* 


^  A  weaker  version  of  Theorem  4.6  with  “univary”  replaced  by  “isometric”  was  discovered  earlier  by  G. 
Julia,  see  [SzFo],  p.  51,  for  the  references.  I  am  indebted  to  N.K.  Nikolski  for  pointing  this  out  to  me. 
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with  the  convention  that  U  maps  an  element  [h]  of  H  ^  H  to 

Ahi  +  Dj\*h2 
—D^hi  +  A*h2 

Clearly  U  £  L{H  ^  H)  and  let  us  now  verify  that  it  fulfills  the  requirements  of  the  theorem. 

First  of  all,  its  compression  to  H  (here  identified  as  the  subset  of  H  ^  H  consisting  of 
elements  with  hi  e  H)  is  A.  Indeed,  the  first  component  of  C/  is  Ahi.  As  for 
unitarity,  Lf*  is  represented  by  the  block  matrix 


A*  -Da 
Da*  a 


so,  at  least  formally,  the  rules  of  matrix  multiplication  give 

_  A  A D\  A* Da*  —  DaA* 

[Da*A-ADa  D%A-AA*  • 

The  diagonal  elements  are  I  by  virtue  of  (4.9),  (4.10)  while  the  element  in  row  2,  column 
1  vanishes  by  (4.11).  The  remaining  off-diagonal  element  is  the  adjoint  of  this  one,  hence 
O.  This  completes  the  proof,  for  people  who  believe  in  block  matrices.  One  can  of  course 
paraphrase  all  these  calculations  by  introducing  the  orthogonal  projector  from  H  ®  H  to 
H  ©  {0}  and  avoiding  block  matrices,  and  it  is  perhaps  an  instructive  exercise  for  the  reader 
to  carry  out  the  proof  in  this  way  too.  But,  block  matrices  are  a  very  convenient  notational 
device. 

This  completes  the  proof  of  Theorem  4.6.  Actually,  we  have  proved  a  bit  more:  the 
unitary  operator  lives  on  Moreover,  if  A  is  a  contractive  operator  from  H  to  K  where 

K  is  any  Hilbert  space  (so  that  A*  €  L{K,  H))  then  A* A  C  L{H),  AA*  e  L{K)  and  we  can 
define  defect  operators  Da  C  L{H)  and  Da*  G  L[K)  as  before.  It  is  then  easy  to  verify  that 
mutatis  mutandis  the  construction  of  a  unitary  operator  as  above  goes  through.  For  details 
see  [GoGoKa]  or  [FoFr].  This  is  a  highly  nontrivial  result  even  for  finite  matrices.  It  implies 
given  any  m  x  n  matrix  which  is  contractive  from  to  C”,  we  can  embed  it  as  the  upper 
left  comer  of  a  unitary  matrix  of  size  (m  +  n)  x  (m  +  n)  (see  (4.12)). 

The  reader  who  has  followed  thus  far  should  now  have  no  trouble  following  the  standard 
proofs  of  Theorems  4.4  and  4.2. 


5.  The  further  development  of  dilation  theory — especially  applications  to  interpolation 
problems 

Since  the  discovery  of  the  results  reported  on  in  the  preceding  section,  they  have  con¬ 
tinued  to  play  a  major  role  in  the  development  of  operator  theory,  especially  to  so-called 
“functional  models”  for  contractions.  This  is  beyond  the  scope  of  the  present  talk,  see  [Pe] 
(Douglas’  article)  and  [SzFo].  But  we  do  wish,  in  closing,  to  say  something  about  one  re¬ 
markable  development  of  dilation  theory  with  spectacular  applications. 
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We  shall  introduce  this  by  going  back  to  the  landmark  paper  [Sal]  which  triggered  this 
development.  Sarason  cast  a  new  light  on  classical  moment  and  interpolation  problems.  We 
can  illustrate  his  approach  with  the  example  of  the  classical  interpolation  problem  of  Pick 
and  Nevanlinna  (henceforth  abbreviated  “PN  problem”).  In  its  simplest  and  purest  form,  this 
is: 

(PN)  Given  are  distinct  points  2:1 , . . . ,  in  the  open  disk  P,  and  complex  numbers 
,  Wn^  Find  necessary  and  sufficient  conditions  that  there  exist  a  holomorphic  func¬ 
tion  f  bounded  in  modulus  by  1  in  P  (we  denote  this  class  henceforth  by  B)  satisfying 

(5.1)  j  =  l,2,...,n. 

There  are  many  further  questions  that  naturally  arise  from  this  one:  If  such  /  exists,  is  it 
unique?  What  can  be  said  about  functions  in  jF/°°(P)  of  least  norm  satisfying  (5.1)7  If  there 
is  more  than  one  f  in  B  satisfying  (5.1),  can  one  describe  the  totality  of  these  functions? 
And,  how  about  the  analogous  problem  with  infinitely  many  points  {zj}l  What  happens  if 
some  Zj  are  permitted  to  coincide,  say  Zi  —  z^  and  one  prescribes  the  functional  f\zf)  in 
(5.1)  to  replace  the  redundant  fizffl  And  so  on.  All  of  these  variants  have  been  studied, 
as  well  as  many  others  (especially,  the  case  where  i/'^(P)  is  replaced  by  an  analogous  class 
of  vector  valued  functions,  which  is  important  in  applications).  We  cannot  here  enter  into 
all  these,  but  will  give  references  at  the  end  of  this  section.  The  remarkable  thing  is  that  the 
path  trodden  by  Sarason  has  turned  out  to  be  fruitful  in  the  study  of  all  these  variants,  and 
moreover  is  virtually  the  only  approach  known  to  yield  results  for  some  of  the  vector- valued 
generalizations  of  (PN). 

The  original  question  was  answered  by  the  following  theorem  of  Pick: 


Theorem  5.1.  Under  the  hypotheses  of(PN)  a  necessary  and  sufficient  condition  for  the 
existence  of  f  in  B  satisfying  (5.1)  is  that  the  matrix 


(5.2) 


1  —  WjWk 
1  -  ZjZk 


n 


be  non-negative  definite. 


Remarks.  To  get  some  feeling  for  the  problem  let  us  look  first  at  simple  cases.  For  n  =  1 
the  problem  is  trivial:  the  desired  /  exists  if  and  only  if  \wi\  <  1.  Moreover,  if  \wi  \  —  1,  the 
solution  is  unique  (the  constant  function  equal  to  tiji),  whereas  if  |ii;i  |  <  1,  there  are  infinitely 
many  solutions,  the  totality  of  which  is  easily  described  (we  shall  return  to  this  point  in  a 
moment).  (The  reader  is  invited  to  examine  carefully  the  case  n  =  2,  and  compare  with  the 
Schwarz  Lemma.) 

It  is  remarkable  that,  by  a  recursive  algorithm  first  proposed  by  I.  Schur  the  general  prob¬ 
lem  (PN)  can  be  reduced  to  the  trivial  case  n  =  1.  Indeed,  whenever  /  €  B,  a  G  D  and 
f{a)  =  b,  either 
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(i)  |6|  =  1  and  /  =  6 

(ii)  |6|  <  1,  and  in  this  case 


:= 


/(^)  -  b 
1  -  bf{z) 


is  in  and  vanishes  at  2:  =  a,  so  that 
h{z)  :=  g(z)  ( 

\  z  ~  a  J  1  —  bf{z)  z  —  a 

is  again  a  function  in  B.  Thus,  if  (PN)  is  solvable  and  \wn\  <  1,  then,  taking  a  =  Zj 
b  =  Wn  in  (5.4)  we  see  that 

f{z)  ~Wn  1  -  ZnZ  , 


1  -  Wnf{z)  Z-Zn 


is  again  in  B.  Moreover 


F{z,)  = 


1  -  ZnZj 


1  -  WnWj 


We  also  see  from  (5.5)  that 


F{Zn)  =  f{Zn) 


=:  F{z) 


{j  =  1). 


1  -  knP' 


Thus,  if  ,  Wn  are  admissible  data  for  f{zi), . . . ,  f{zn)  with  /  G  B  so  are  the  n 

numbers  appearing  on  the  right  hand  sides  of  (5.6)  and  (5.7).  Now,  suppose  we  are  able  to 
solve  the  (n  -  l)-point  version  of  (PN)  for  F  G  B  satisfying  (5.6).  Then,  from  (5.5)  solved 
for  /  we  obtain  a  function  in  B,  Indeed,  solving  (5.3)  for  /  gives 

(5.8)  f{z)  = 

1  +  bg(z) 

which  is  in  B  if  5  is,  so  the  solution  to  (5.5)  is 

(5-9)  f(z)  = 

1  +  WnG{z) 

where 


G(z)  := 


is  in  B. 


Recapitulating;  If  (5.6)  is  solvable  with  F  e  B,  then  /  defined  by  (5.9)  and  (5.10)  is  in 
B,  and  it  is  easy  to  check  from  these  formulae  that  /  satisfies  all  the  n  conditions  (5.1). 
Therefore:  If  <  1,  the  (PN)  problem  reduces  to  one  with  n  —  1  points;  whereas  if 
|u;„|  =  1  we  see  at  a  glance  that  the  problem  is  solvable  if  and  only  if  all  remaining  wj  are 
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equal  to  Wn\  but  of  course  if  >  1  there  is  no  solution.  Let  us  call  the  last  two  cases  trivial 
cases.  We  have  thus  arrived  at  Schur's  algorithm: 

If  \wn\^  1  we  are  done,  trivially.  If  <  L  we  construct  the  new  numbers 


Wj  -Wn  1  -  ZnZj 
1  WjiWj  Zj  '  Zji 


If  \W„.i\  ^  1  we  are  done,  trivially;  if  |VLn-i|  <  1  we  reduce  to  a  problem  forn  —  2 
points  by  the  analogous  formulae  applied  ioWi,. Wn-u  and  so  on. 

We  therefore  compute  recursively  a  sequence  of  complex  numbers  M4,  . . .  and 

(PN)  is  solvable  if  and  only  if  either  all  of  these  “Schur  parameters”  remain  in  D,  or  one  of 
them  is  on  the  unit  circle  and  the  remaining  data  at  that  point  are  equal  to  this  one. 

Volumes  have  been  written  about  the  Schur  algorithm  and  various  generalizations,  an 
excellent  source  is  [FoFr]  and  we’ll  give  other  references.  Nevertheless,  although  this  ade¬ 
quately  answers  (PN)  for  small  n,  it  does  not,  at  least  in  an  obvious  way,  yield  the  elegant 
theorem  5.1,  to  whose  proof  we  now  return. 


Following  in  Sarason’s  footsteps:  if  we  are  to  find  an  operator-theoretic  proof  of  Theorem 
5.1,  which  involves  an  unknown  function  in  we  must  start  by  “encoding”  such  functions 
as  linear  operators  on  a  suitable  Hilbert  space.  Let  us  try  —  i?^(T)  as  our  Hilbert  space. 
Then,  any  ip  G  H^{T)  induces  a  linear  operator  on  by  associating  with  it  the  multipli¬ 
cation  operator  :  g  pg  on  It  is  easy  to  show  the  operator  norm  1|M^||  equals 
||(^||oo-  Moreover,  these  multiplication  operators  are  distinguished  within  L{H‘^)  by  a  simple 
property:  denoting  by  Mz  the  “shift”  on  (i.e.  multiplication  by  the  independent  variable) 
we  have 


Lemma  5.2.  (Sarason).)  If  A  e  and  A  commutes  with  Mz,  then  A  —  for 

some  p  G 

The  converse  is  obvious.  The  reader  should  be  able  to  prove  Lemma  5.2,  starting  with  the 
simple  observations  that  A  commutes  with  Mp  for  every  polynomial  p:  A{pf)  =  p{Af)  for 
all  /  G  hence  Ap  =  pp  for  every  polynomial  p,  where  p  A\.  The  main  point  is  to 
deduce  that  this  p,  so  far  only  known  to  be  in  is  in 

Now  that  we  have  a  nice  way  to  encode  as  operators  on  we  need  a  suitable  way 
to  encode  the  data  (5.1).  Sarason  had  the  inspired  idea  to  use  for  this  purpose  the  Toeplitz 
operator  with  symbol  /.  For  any  p  G  L°°(T),  “multiplication  by  cp”  is  a  bounded  operator  on 
L^(T).  Its  compression  to  the  subspace  H^{T)  is  called  the  Toeplitz  operator  with  symbol  p, 
and  usually  denoted  T^.  Thus 

(5.11)  T^g  =  PM^g  =  P(<pg)  ,  <p  €  L^{J),g  €  H^(T) 

where  P  denotes  orthogonal  projection  of  on  It  is  easy  to  check  the  properties 

(5.12)  lir^ll  ^  lly^lloo  (with  equality  if  ip  €  H°°). 
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(5.13)  t;  =  t^. 

See,  e.g.  [Do],  [Ni]  for  extensive  discussion  of  this  important  class  of  operators.  Now,  another 
important  property  of  Toeplitz  operators  with  conjugate-analytic  symbol  is  this:  let,  for  (  6 
O, 

(5.14)  kc{z)  =  1  . 

1  - 

This  is  the  “representing  element”  for  the  functional  /•-)■/(()  on  j  ^ 

(5.15)  {f,k)  =  m  , 

Crucial  for  us  is  the  identity 

(5.16)  T^kc=W)k  1 

The  proof  is  easy,  to  verify  (5.16)  it  suffices  to  show,  for  each  /  E  that  both  sides 
have  the  same  inner  product  with  /.  Now, 

{Tfk,  f)  =  (kf,  T^f)  =  {ko<pf)=W)m= W),  ik.  /). 

proving  (5.16),  Hence,  Sarason’s  reformulation  of  (PN)  is: 

Given  are  distinct  points  Zi^ . . . ,  ZninlD  and  complex  numbers 


Wu..,,Wn 

Find  necessary  and  sufficient  conditions  that  there  exist  a  linear  operator  T  on  such  that 

(5.17)  ||T||  ^  1 


(5.18)  T  commutes  with  the  shift 
and 

(5.19)  T*k,.  =  Wjk,.  (j  =  1, 2, . . . ,  n). 

In  view  of  our  preceding  discussion,  it  is  clear  that  this  problem  is  completely  equivalent 
to  (PN).  Actually,  it  is  slightly  more  convenient  to  replace  T  by  its  adjoint,  and  then  (denoting 
T*  by  S)  the  above  conditions  become 


ll^ll  <  1 

S  commutes  with  the  backward  shift  M*  =  on 
Sk^.=wjk,.  (i  =  l,2,...,n). 

Now,  let  N  denote  the  span  of  the  vectors  k^.  (j  1 , 2, . . . ,  n).  If  5  exists  satisfying  5,  5 


and  5  then  *9 AT  c  so  the  norm  of  5'  |  A  is  at  most  one,  that  is 


(5.20) 


j-l 


3=1 
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holds  for  all  complex  n-tuples  {ij}.  Expanding  the  squares  and  using  the  identity  = 

kz.{zj)  =  1/(1  —  ZiZj),  and  a  little  manipulation,  one  sees  that  (5.20)  is  equivalent  to  the 
non-negative  definite  character  of  the  matrix  (5.2).  This  already  shows  the  necessity  of  that 
condition  for  (PN),  and  now  we  turn  to  the  more  difficult  issue  of  sufficiency.  So,  suppose 
(5.2)  is  non-negative  definite,  which  is  equivalent  to  (5.20),  i.e.  to  the  assertion 

(5.21)  5  I  iV  has  norm  at  most  one. 

Thus,  S  \  N  is  a  contractive  operator  on  N,  and  it  commutes  with  the  restriction  to  N  of  the 
backward  shift  (indeed,  each  operator  admits  all  the  k^.  as  eigenvectors). 

To  finish  the  proof  it  suffices  to  show: 

(E)  5  I  has  an  extension  to  that  is  a  contraction,  and  commutes  with  T^.  ^ 

Indeed,  if  (E)  is  proven,  and  the  extension  is  denoted  5,  then  S  commutes  with  so  (5)* 
commutes  with  (T^)*  —  and  hence  is  of  the  form  Tj  for  some  /  in  by  Lemma  5.2. 

Since  Tj  (which  is  simply  multiplication  by  /)  is  a  contraction,  ||/|loo  ^  1-  Thus,  S  =  Tf 
and  so  (using  5): 

Wjkj  =  Skj  =  Tjkj  =  f{zj)kj 

so  f{zj)  —  Wj  {j  =  1,2, ...  ,n),  and  this  completes  the  proof. 

Sarason  was  aware  that  there  was  more  at  stake  here  than  just  another  solution  to  some 
classical  interpolation  problems.  To  grasp  what  this  is,  write  H  (a  general  Hilbert  space)  in 
place  of  and  consider  a  subspace  N  that  is  invariant  for  an  operator  A  on  H  (in  PN  the 
role  of  A  is  played  by  T^).  Let  now  S  e  L(N)  commute  with  A  \  N.  And  we  ask: 

(EG)  Under  these  conditions,  can  S  be  extended  to  all  of  H  as  a  linear  operator  with  the 
same  norm,  which  moreover  commutes  with  A? 

That  is,  at  bottom  the  fundament  of  Pick’s  theorem  is  something  very  general,  and  geo¬ 
metric:  an  extension  property  for  linear  operators  from  a  subspace  of  a  Hilbert  space  to  the 
whole  space  with  preservation  of  (i)  the  norm  and  (ii)  a  commutation  relation.  Now  exam¬ 
ples  show  that  this  is  not  always  possible.  But,  after  the  appearance  of  Sarason’s  paper,  Foias 
and  Sz.-Nagy,  using  dilation  theory,  succeeded  to  show  that  (EG)  has  an  affirmative  answer 
whenever  A  is  a  co-isometry  on  H,  that  is.  A*  is  an  isometry.  This  covers  the  case  discussed 
by  Sarason,  since  there  A*  is  the  shift.  (Sarason  was  able  to  prove  (E)  by  ad  hoc  methods;  the 
Foias-Sz.-Nagy  result  is  a  new  milestone,  and  opened  up  a  fast-developing  branch  of  operator 
theory.)  The  assertion  that  the  extension  asked  for  by  (EG)  exists  when  A  is  a  co-isometry 
is  sometimes  called  the  “commutant  lifting  theorem”  (CLT)  because,  in  terms  of  the  adjoint 
operators,  it  becomes  a  question  of  lifting  (rather  than  extending)  an  operator  commuting 
with  an  isometry.  This  in  turn  has  a  further  generalization  to  the  intertwining  lifting  theo¬ 
rem.  Also,  the  CLT  turns  out  to  be  intimately  connected  to  the  possibility  of  simultaneous 
unitary  dilation  of  two  commuting  operators  (Ando’s  theorem)  and  von-Neumann  inequali¬ 
ties  for  polynomial  functions  of  two  commuting  contractions.  For  all  this,  see  [FoFr],  [SzFo] 
and  [Sz]. 
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For  all  historical  background  concerning  (PN)  and  related  problems,  an  excellent  source 
is  [FrKi],  which  includes  reprints  of  the  fundamental  papers  of  Herglotz,  Schur,  Pick  and 
Nevanlinna,  as  well  as  a  masterful  historical  review  by  B.  Fritzsche  and  B.  Kirstein  and  a 
thorough  bibliography.  The  monograph  [DuFrKi]  is  devoted  to  matricial  generalizations. 
Other  valuable  references  will  be  given  in  our  Appendix. 

Al.  Acknowledgements.  It  is  a  pleasure  to  thank  Bemd  Kirstein,  Karim  Kellay,  and  Mihai 
Putinar  for  valuable  advice  concerning  the  selection  of  material,  and  the  literature.  I  regret 
that  lack  of  space  has  made  it  unfeasible  to  take  up  some  very  important  topics  that  relate 
to  the  current  material,  especially  Hankel  and  Toeplitz  operators.  (See  [Sa2],  [Pol],  [Ni] 
and  [BoeKa]  for  this  material,  and  references  to  other  relevant  literature).  I  also  express  my 
gratitude  to  Siv  Sandvik,  who  produced  this  fine  I^^manuscript  from  my  hieroglyphics, 
under  time  pressure. 

A.2.  Suggested  reading.  The  reader  who  wishes  to  learn  more  about  the  material  we  have 
discussed  is  referred  to  the  following  literature,  which  supplements  the  sources  we  already 
have  named. 

Stationary  random  sequences,  prediction  theory:  [Lam,  Ya,HeSz,HeSa,Sa3],  [Ho,  Chapter  4]. 

Operator-theoretic  approaches  to  interpolation  problems.  Here  we  have  barely  scratched  the 
surface.  One  alternative  approach,  building  on  the  theory  of  Krein  spaces  (a  generalization 
of  Hilbert  spaces,  where  the  inner  product  is  given  by  an  indefinite  Hermitian  form)  to  (PN) 
and  related  problems  is  due  to  Ball  and  Helton.  A  very  nice  introduction  to  this  theory 
is  Sarason’s  lectures  in  the  volume  [Po2].  Also  recommended  from  the  same  volume  is 
N.K.  Nikolskii’s  lectures  on  Hankel  and  Toeplitz  operators,  where  the  pace  is  more  leisurely 
than  in  the  comprehensive,  standard  monograph  [Ni].  See  also  N.J.  Young’s  article  in  [Po2]. 
The  monograph  [Pol]  is  also  recommended,  and  [Sa2]. 

An  important  topic  we  have  not  mentioned  is  the  class  of  Hilbert  spaces  of  entire  func¬ 
tions  due  to  de  Branges  [Br],  which  are  becoming  increasingly  popular  as  new  applications 
and  connections  are  discovered  — for  instance,  at  the  time  of  this  writing  to  “frames”  and 
sampling  theory  by  J.  Ortega-Cerda  and  K.  Seip. 

The  collection  [Pe]  is  a  gold  mine.  It  contains  an  excellent  introduction  by  Sarason  to  in¬ 
variant  subspaces,  as  well  as  A.L.  Shields’  survey  of  weighted  shift  operators,  and  R.G.  Dou¬ 
glas’  survey  article  on  canonical  models  for  operators,  giving  the  “state  of  the  art”  of  dilation 
theory  and  its  ramifications  as  of  1970.  The  conference  volume  [Lan]  contains  an  article  of 
Sarason  wherein  he  describes,  with  his  usual  clarity,  the  relation  of  moment  theorems  to  op¬ 
erator  theory  in  Hilbert  space.  On  this  score  see  also  the  monograph  [RoRo]  which  gives  a 
compact  and  unified  framework  for  the  application  of  Hilbert  space  methods  (especially  corn- 
mutant  lifting)  to  problems  of  interpolation  both  in  D  and  D,  moment  theorems,  Loewner’s 
theorem  on  monotone  matrix  functions,  and  more.  For  interpolation  of  vector  and  matrix 
valued  functions,  see  [DuFrKi],  [FoFr]  and  [BaGoRo]. 
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Fairly  recently,  theorems  of  Pick-Nevanlinna  type  in  two  or  more  variables  have  started 
to  appear,  a  pioneering  role  having  been  played  by  Jim  Agler.  See  [AgMc]  for  an  account 
up  to  1997.  A  spectacular  application  of  operator  theory  to  “hyperbolic  geometry”  was  given 
in  1990  by  Agler  [Ag],  who  proved,  using  dilation  techniques,  the  celebrated  theorem  of 
L.  Lempert,  that  the  Caratheodory  and  Kobayashi  metrics  agree  on  bounded  convex  domains. 
(To  find  out  what  those  terms  mean,  the  reader  may  consult  the  nice  introductory  book  [Kr]). 

A  lively  interest  in  generalizations  of  Beurling’s  theorem  as  well  as  of  (PN),  often  in 
combination,  and  using  tools  from  dilation  theory,  persists  up  to  the  present  day.  See,  for 
example  [Qu],  [McTr]  and  [GrRiSu].  This  work  has  been  catalyzed  in  part  also  by  new  vistas 
and  rich  structure  encountered  in  the  recent  work  on  Bergmann  spaces,  starting  with  the 
ground  breaking  paper  of  Hedenmalm  [He].  We  refer  the  reader  to  a  forthcoming  monograph 
by  Hedenmalm  et  al.  The  current  work  [AlRiSu2],  with  a  wealth  of  interesting  results  for 
classical  analysts  as  well  as  a  good  bibliography,  is  also  recommended. 

Applications  of  Toeplitz  matrices  and  operators  in  science  and  engineering  are  many.  An 
interesting  variant  of  the  Caratheodory-Fejer  problem  important  for  engineering  applications 
due  to  T.  Georgiou  is  [Ge]  and  was  followed  up  in  later  work,  see  [ByLi]  for  a  recent  account. 
Those  are  also  beautiful  applications  to  statistical  physics,  a  good  survey  article  is  [Boe]. 
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Abstract.  This  is  an  expository  paper,  with  an  emphasis  on  the  history  of  the  sub¬ 
ject.  It  consists  of  three  main  parts: 

history,  terminology,  and  examples  (Sections  1  and  2), 
functions  and  series  (Sections  3, 4,  and  5), 
thin  sets  (Sections  6, 7,  8,  9,  and  10). 

I  chose  the  topics  and  references  according  to  my  own  interest,  but  I  decided  not  to 
develop  what  I  wrote  recently  elsewhere  [28],  so  that  I  was  very  brief  at  the  end,  in 
particular,  on  topics  I  like  best. 


1.  Lebesgue,  Baire,  and  trigonometric  series 

Lebesgue’s  measure  theory  and  Baire’s  category  theorem  are  both  a  hundred  years  old 
[1,2,43,48].  From  the  very  beginning,  they  were  linked  with  trigonometric  series.  Baire  was 
inspired  by  series  of  continuous  functions:  His  book  “Le9ons  sur  les  fonctions  discontinues” 
begins  with  the  example  of  functions  defined  as  sums  of  everywhere  convergent  trigonometric 
series,  the  main  object  of  Riemann’s  thesis  on  trigonometric  series  [3,53].  The  theory  of  the 
Lebesgue  integral,  as  we  now  know  it,  was  first  presented  in  Lebesgue’s  book  “Lemons  sur 
les  series  trigonomdtriques”  [44].  Both  books  resulted  from  courses  given  at  the  College  de 
France.  Baire  and  Lebesgue  had  been  elected  “Peccot  lecturers,”  which  was  an  opportunity 
given  to  young  mathematicians  to  address  younger  students.  In  fact,  Denjoy  attended  Baire’s 
lectures  and  wrote  part  of  Baire’s  book  (1905),  and  Fatou  attended  Lebesgue’s  lectures  and 
contributed  to  Lebesgue’s  book.  Fatou  wrote  his  thesis,  “Series  trigonometriques  et  series  de 
Taylor,”  at  the  same  time  (1906)  [14]. 
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The  relation  between  harmonic  analysis  and  integration  theory  is  completely  natural, 
since  the  first  instance  of  harmonic  analysis  is  the  computation  of  Fourier  coefficients  us¬ 
ing  the  Fourier  formulas,  and  these  formulas  involve  an  integral.  The  integral  in  question 
was  not  the  same  for  Dirichlet,  Riemann,  Lebesgue,  and  Denjoy,  but  it  was  either  applied 
(in  the  case  of  Dirichlet  and  Lebesgue)  or  designed  (in  the  case  of  Riemann  and  Denjoy) 
for  the  specific  purpose  of  giving  a  sense  to  the  Fourier  formulas.  In  particular,  the  second 
Denjoy  totalization  was  invented  specifically  to  compute  the  Fourier  coefficients  of  functions 
considered  by  Riemann,  namely,  the  sums  of  everywhere  convergent  trigonometric  series. 
Riemann  stated,  without  giving  a  proof,  that  the  coefficients  must  to  tend  to  zero.  Cantor 
proved  this,  and  moreover,  he  proved  that  the  coefficients  are  well  defined  when  the  function 
is  given.  (This  is  Cantor’s  uniqueness  theorem.)  Denjoy,  using  the  Lebesgue  integral,  the 
Baire  category  theorem,  and  the  transfinite  induction  of  Cantor,  provided  a  way  to  compute 
these  coefficients.  This  work  was  sketched  in  a  Comptes-rendus  note  in  1921  and  was  later 
developed  in  a  series  of  books  between  1941  and  1949  [9, 10]. 

This  Denjoy  integral,  called  “the  second  totalization”  (the  first  was  intended  to  compute 
the  primitive  of  the  most  general  derivative  [8]),  is  worth  mentioning  here  because  it  involves 
both  the  Lebesgue  integral,  which  is  the  prototype  of  a  probability  measure,  and  the  Baire 
category  theorem.  Furthermore,  it  performs  the  most  elementary  step  of  harmonic  analysis 
perfectly.  However,  the  Denjoy  integral  plays  no  role  in  the  rest  of  the  paper.  What  is  needed 
from  now  on  is  Lebesgue  measure  on  [0, 1],  or  any  equivalent  notion  of  a  probability  measure, 
and  the  Baire  theorem  expressed  as  follows: 

Baire’s  category  theorem.  Let  X  be  a  complete  metric  space,  and  let  (7„,  n  =  1, 2, . . .,  be 
a  sequence  of  dense,  open  subsets  of  X.  Then  the  intersection  of  the  Gn,  is  dense 

in  X.  (n^iGn  is  a  Gs  set.^) 

Note  that  Baire  could  not  express  his  theorem  in  these  terms;  the  notions  of  metric 
space  and  complete  metric  space  appeared  later.  They  appear  at  the  very  beginning  of  Ba¬ 
nach’s  book,  “Thdorie  des  operations  lin^aires”  (1932),  and  they  are  followed  immediately 
by  Baire’s  theorem  [4].  The  Polish  mathematicians  of  that  time  were  experts  in  using  Baire’s 
theorem  in  a  variety  of  situations.  It  is  used  in  Banach’s  book  to  prove  the  Banach-Steinhaus 
theorem,  which  was  a  new  way  to  look  at  the  old  “principle  of  condensation  of  singulari¬ 
ties.”  In  turn,  the  Banach-Steinhaus  theorem  is  used  to  show  that  there  exists  a  continuous 
function  whose  Fourier  series  diverges  at  a  point,  the  du  Bois-Reymond  phenomenon  [11]. 
I  discovered  the  power  of  Baire’s  theorem  in  harmonic  analysis  when  Katznelson  used  it  to 
prove  that  only  analytic  functions  operate  on  the  Wiener  algebra,  A(T)  —  the  space 

of  continuous  functions  on  T  whose  Fourier  series  converge  absolutely.  This  result  is  the 
converse  of  the  Wiener-Levy  theorem,  which  asserts  that  analytic  functions  do  operate  on 
the  algebra  A(T)  [34]. 

^  Any  countable  intersection  of  open  sets  is  called  a  Gs  set,  and  any  countable  union  of  closed  sets  is  called 
an  Fs  set 
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I  shall  not  develop  the  complete  role  of  measure  and  topology  in  harmonic  analysis,  but 
rather  consider  how  they  are  involved  in  several  specific  questions. 

2.  Almost  sure  and  quasi  sure;  examples 

In  the  terminology  of  Baire,  the  countable  unions  of  nowhere  dense  sets  are  called  sets 
of  the  first  category.  In  Bourbaki’s  terminology,  they  are  called  meager  sets.  They  are  the 
analogues  of  null  sets  in  measure  theory. 

The  complements  of  sets  of  the  first  category  (meager  sets)  are  called  sets  of  the  second 
category.  They  are  the  analogues  of  sets  of  full  measure,  and  they  can  be  defined  as  sets  that 
contain  a  dense  Gg  set.  When  a  property  holds  on  a  dense  Gg  set  it  is  called  generic  or  quasi 
sure.  This  is  the  analogue  of  almost  sure  in  probability  theory.  We  also  say  that  the  property 
holds  quasi  everywhere,  the  analogue  of  almost  everywhere,  or  that  it  is  enjoyed  by  quasi  all 
points,  which  is  the  analogue  of  almost  all.  The  abbreviations  q.s.  and  q.e.  are  parallel  to  a.s. 
and  a.e. 

The  analogy  goes  further.  There  is  a  theorem  by  Kuratowski  and  Ulam  that  is  analogous 
to  Fubini’s  theorem,  and  it  is  convenient  for  us  to  state  it  in  the  following  form: 

The  Kuratowski-Ulam  theorem  [28].  Let  X  and  Y  be  Baire  spaces  (for  example,  complete 
metric  spaces).  Suppose,  moreover,  that  Y  is  separable,  that  is,  there  is  a  countable  base  of 
open  sets.  Let  A  be  a  dense  Gg  set  in  X  xY.  For  each  x  e  X,  define  A{x,  •)  —  {y  €  F  | 
(x,  y)  Ei  A}.  Then  A(a;,  •)  is  a  dense  Gg  set  in  Y  for  quasi  all  a;  G  X. 

It  is  important  to  realize  that  almost  sure  and  quasi  sure  properties  can  be  very  different, 
that  null  sets  can  be  sets  of  the  second  category,  and  that  sets  of  full  measure  can  be  meager 
sets.  Here  are  a  few  examples. 

1.  Let  X  —  {-1,1}^.  X  is  both  a  complete  metric  space  and  a  probability  space,  when 
equipped  with  the  usual  probability.  Let  (c„),  n  G  N,  be  a  positive  sequence,  and  consider 
the  series  ±c„  indexed  by  (di)  G  X.  Then 

^2  converges  a.s.  ^  <  oo, 

^2  converges  q.s.  ^  c„  <  oo. 

The  first  result  is  a  theorem  of  Rademacher,  Khintchin,  and  Kolmogorov.  The  second  is  an 
easy  exercise:  When  J]]  c„  =  oo,  the  subset  K  of  X  on  which  some  partial  sum  of  each  series 

±c„,  (±)  G  Y,  exceeds  a  given  number  N,  is  open  and  dense.  Therefore  the  series  is 
divergent  on  a  dense  Gg  set  of  X.  In  other  words,  the  series  Y  diverges  q.s.  as  soon  as 
any  one  of  them  diverges. 

2.  Let  X  =  {—1, 1}^  as  before,  but  now  consider  the  Dirichlet  series 

s  =  a  +  it  These  series  converge  when  a  >  1.  We  write  f{s)  either  for  their  sum  when 
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(j  >  1  or  for  their  analytic  continuation.  Choosing  (±)  G  X,  the  natural  boundary  of  f(s)  is 
the  line  cr  =  1  q.s.  and  the  line  a  =  1/2  a.s. 

It  is  more  interesting  to  consider  i((2n  -  1)"'®  -  (2n)”*),  with  (±)  e  X.  Then 
the  natural  boundary  of  the  corresponding  function,  which  we  denote  by  /i(s),  is  cr  =  0  q.s. 
and  a  =  -1/2  a.s.  Furthermore,  the  order  of  /i(5), 

/x(cr)  =  {inf  a  |  /(cr  +  it)  =  0(|t|“),  |^|  oo}, 

is  (1  -  cr)'*’  =  max{0, 1  -  cr}  on  (0,oo)  q.s.  and  (1/2  -  cr)+  a.s.  If  the  difference  (2n  - 
l)-5  _  (2n)"^  is  replaced  by  the  differences  of  higher  orders, 

((4n  -  3)-^  -  (4n  -  2)“^)  -  ((4n  -  1)^  -  (4n)-), 

((8n  -  7)“^  -  (8n  -  G)"")) - +  ((8n  -  1)"^  -  (8^)"®), 

and  so  on,  we  obtain  functions  /2(s),  his), . . .,  that  are  defined  on  larger  and  larger  domains. 
The  functions  /i(s),  72(5),  h(s),. . .  are  all  of  the  form  ±n~^  for  cr  >  1,  although  the 
domain  of  the  coefficients  (±)  becomes  restricted  as  the  process  continues.  Furthermore, 
/i(cr)  =  (1  —  cr)+  q.s.  and  /j,{a)  =  (1/2  —  cr)"^  a.s.  on  R  where  these  functions  are  defined. 
By  taking  successive  blocks  of  terms  from  the  series  representing  the  functions  /i(s),  f2{s), 
/3(s), . . .,  it  is  possible  to  define  an  entire  function  /oo(s)  that  it  is  represented  by  a  series 
of  the  form  ±n~^  when  cr  >  1,  and  such  that  //(cr)  —  (1  —  cr)+  q.s.  and  //(cr)  = 
(1/2  —  cr)'*'  a.s.  on  (—00, 00).  Many  variations  are  possible,  and  the  quasi-sure  constructions 
play  an  important  role  in  the  study  of  convergence  and  summability  properties  of  products  of 
Dirichlet  series  [23,32,51]. 

3.  Let  X  =  T  and  consider  the  Hardy -Weierstrass  function 

00 

f(t)  —  y^2"”  cos(27r2”^). 


Geza  Freud  made  a  careful  study  of  this  function.  First,  as  already  noticed  by  Hardy,  it  is 
nowhere  differentiable.  However,  Af{t)  =  f(t  +  h)  —  f{t)  is  0{\h\)  as  h  0  for  some  i, 
called  slow  points.  Actually,  the  set  of  slow  points  is  both  a  null  set  and  a  meager  set,  but 
it  has  Hausdorff  dimension  1.  The  modulus  of  continuity  of  /  is  0(/ilog  ^),  meaning  that 
lA/(i)|  <  (7|/i|  log  for  all  t.  However,  there  are  rapid  points  t  such  that 


\m)\ 

t.^0  Hlog^ 


>0, 


and  indeed,  quasi  all  t  are  rapid  points.  On  the  other  hand, 


0  <  lim 


|A/(t)| 

l'‘l(loglSllog>08logj^)'^'^ 


<  OO 


for  almost  all  t. 


J.-P.  Kahane  /Probabilities  and  Baire’s  theory  in  harmonic  analysis 


61 


We  see  from  these  examples  that  the  Baire  approach  emphasizes  divergence,  singularities, 
and  large  values,  while  the  probabilistic  approach  favors  convergence,  smoothing,  and  regu¬ 
larizing  effects.  We  shall  encounter  these  ideas  throughout  the  paper.  However,  the  first  effect 
of  both  approaches  is  to  convert  rather  strange  phenomena  into  familiar  ones,  to  tame  mon¬ 
sters,  or  to  generate  monsters  in  a  familiar  way.  I  shall  illustrate  this  aspect  by  considering 
nondifferentiable  continuous  functions  and  noncontinuable  Taylor  series,  before  considering 
other  questions. 


3.  Nowhere-differentiable  functions 

Nowhere-differentiable  continuous  functions  and  noncontinuable  Taylor  series  (i.e.,  ana¬ 
lytic  functions  whose  domain  of  existence  is  a  disc)  were  both  discovered  by  Weierstrass.  I 
remind  you  of  what  Hermite  said  about  nowhere-differentiable  functions  in  a  letter  to  Stielt- 
jes:  “Je  me  detoume  avec  horreur  et  eflfroi  de  cette  plaie  lamentable  des  fonctions  continue 
qui  n’ont  pas  de  derivee.” 

Nevertheless,  such  functions  attracted  the  attention  of  the  physicist  Jean  Perrin  when  he 
observed  the  trajectories  of  Brownian  particles,  and  they  became  part  of  Wiener’s  program  to 
build  a  mathematical  theory  of  Brownian  motion,  where  the  realizations  are  a.s.  continuous 
and  nowhere  differentiable.  Wiener  quoted  Jean  Perrin  several  times,  and  his  program  was 
only  achieved  in  1933  with  the  help  of  Paley  and  Zygmund.  Nowadays  the  local  behavior 
of  Brownian  motion — replete  with  the  concepts  of  a  strong  form  of  nowhere  differentiability, 
modulus  of  continuity,  average  behavior  (law  of  the  iterated  logarithm),  rapid  points,  and 
slow  points — is  well  understood  (see  [24]  or  [29]  for  references). 

The  same  is  true  for  the  Baire  point  of  view.  Let  us  consider  C(/),  the  space  of  real 
continuous  functions  on  the  interval  /  =  [0, 1].  Given  any  sequence  0  and  an  integer 
I/,  the  set  of  /  G  C(/)  with  the  property  that 


.  3”>-«(=0,, . .-1), 

is  a  dense,  open  subset  of  C{I),  which  we  denote  by  If  /  G  then 


Tj-  \  f{t  +  h)- f{t)\ 
h^O  (p{\h\) 


=  oo 


for  all  t  G  [0, 1] 


whenever  is  an  increasing  function  such  that  (p{l/h)  =  o(An)  as  n  ->  oo.  This  is  a  very 
strong  form  of  nowhere  differentiability.  In  particular,  the  Holder  exponent  of  quasi  all  /  is  0 
at  every  point.  Quasi  surely  in  (7(7),  the  multifractal  analysis  is  trivial.  To  have  an  interesting 
multifractal  analysis  as  a  generic  phenomenon,  more  restricted  classes  of  functions  should  be 
considered,  as  Stephane  Jaffard  has  done  [17]. 
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4.  Random  Taylor  series:  continuation,  convergence,  and  divergence 

Weierstrass  used  a  lacunary  trigonometric  series  to  construct  a  continuous  nowhere-dif- 
ferentiable  function.  Later,  he  constructed  a  noncontinuable  Taylor  series  using  the  same 
idea.  Poincare  and  Hadamard  also  used  lacunary  Taylor  series  to  construct  noncontinuable 
functions.  Then,  in  1896,  Borel  issued  a  strange  statement,  namely,  that  in  general,  a  Taylor 
series  is  not  continuable  across  its  circle  of  convergence. 

Borel  had  in  mind  a  probabilistic  interpretation  of  this  statement.  He  spoke  of  arbitrary 
coefficients  (coefficients  quelconques)  and  clearly  thought  of  random,  independent  phases. 
In  fact,  he  stated  and  used  a  first  version  of  the  Borel-Cantelli  lemma  for  that  purpose.  But 
Borel’s  theory  of  countable  probabilities  was  not  yet  bom,  and  “in  general”  could  not  have  a 
precise  meaning  in  1896. 

Steinhaus  gave  a  rigorous  interpretation  of  probabilistic  concepts  by  means  of  Lebesgue 
measure  on  7  =  [0, 1].  Using  binary  expansions,  he  first  transferred  Lebesgue  measure  to  the 
space  {0, 1}^,  then  to  the  same  space  written  as  {0, 1}^,  and  finally  to  7^.  In  this  way  he 
obtained  the  so-called  Steinhaus  sequences  (a;„)  €  7^,  for  which  he  proved  a  zero-one  law. 
Then,  by  considering  random  series 

oo 

(S)  = 

n-yoo 

n=0 

it  was  not  difficult  to  prove  that  noncontinuation  holds  almost  surely  (1923,  1929,  [54,55]). 
Paley  and  Zygmund  proved  the  same  result  for  Rademacher  Taylor  series 

00 

(R)  Y^±anz’' 

n=0 

in  1932.  They  also  considered  random  trigonometric  series,  and  this  was  the  beginning  of 
the  theory  of  random  series  of  functions.  In  1933,  Wiener  joined  them  and  proposed  to 
study  Gaussian  series,  such  as  the  Wiener  Fourier  series  of  Brownian  motion,  in  parallel  with 
Steinhaus  or  Rademacher  trigonometric  series.  To  make  the  notation  simpler,  I  shall  consider 
only  Gaussian  Taylor  series 

CXD 

(G)  ^  ^  Cn^n^  5 

where  the  €  C  are  independent,  normalized  Gaussian  random  variables.  When  2;  = 

(S) ,  (R),  and  (G)  appear  as  random  trigonometric  series  with  only  positive  frequencies.  The 

history  of  these  series,  culminating  with  the  work  of  Marcus  and  Pisier,  can  be  found  in 
the  second  edition  of  my  book  “Some  Random  Series  of  Functions”  (1985)  [24].  Roughly 
speaking,  it  was  known  in  1933  that  the  series  (S),  (R),  and  (G)  represent  a.s.  functions  in 
77\  77^,  and  77^  with  p  <  00,  if  and  only  if  <  00.  Marcus  and  Pisier  established 

that  uniform  convergence  in  the  closed  unit  disc  has  the  same  probability,  0  or  1,  for  series 
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(S),  (R),  and  (G),  and  in  this  way  reduced  the  study  of  uniform  convergence  of  Rademacher 
Taylor  series  to  the  same,  but  more  tractable,  problem  involving  Gaussian  processes. 

Nevertheless,  not  all  results  are  the  same  for  the  series  (S),  (R),  and  (G).  An  interesting 
example  is  the  problem  of  divergence  everywhere  at  the  boundary  of  the  unit  disc,  which  was 
considered  by  Dvoretzky  and  Erdos  [23].  The  condition 


(*) 


lim  - - Idi  a2 CLn\  >  0 

n-voo  log  n 


implies  that  each  of  the  series  (S),  (R),  and  (G)  diverges  everywhere  on  the  circle  =  1 
[24].  But  there  are  series  (R)  that  converge  somewhere,  while  the  corresponding  (G)  diverges 
everywhere.  It  suffices  to  choose 


oo  oo 

(R)  :  and  (G)  :  , 

j=0  j=0 

where  gj  =  ^4?,  aj  =  o(l),  limj-^ooQJjV^og  j  =  00,  and  —  00  a.s.  This 

example  also  shows  that  logn  cannot  be  replaced  by  any  o(logn)  in  condition  (*).  (See  the 
Paley-M.  Weiss  theorem  in  Section  7.) 

Divergence  everywhere  raises  apparently  difficult  questions.  Is  it  true  that  the  probability 
of  everywhere  divergence  increases  when  we  increase  the  moduli  of  the  coefficients?  The 
corresponding  questions  for  convergence  everywhere,  convergence  almost  everywhere,  and 
divergence  almost  everywhere  (decreasing  or  increasing  the  moduli  of  the  coefficients,  ac¬ 
cording  to  the  case)  have  positive  answers.  The  question  can  be  considered  for  other  series 
of  functions,  for  example,  for  random  trigonometric  series  or  for  random  Walsh  series. 

Other  problems  arise  when  we  consider  the  properties  of  analytic  functions  defined  by 
the  series  (R),  (S),  or  (G).  Does  their  range  cover  the  plane  C  and  by  how  much?  The  answer 
is  fairly  precise  for  (G),  but  it  is  still  incomplete  for  (R)  and  (S)  (references  can  be  found 
in  [27],  p.  267). 


5.  Generic  trigonometric  and  power  series 

Let  me  turn  to  another  interpretation  of  Borel’s  statement  on  the  noncontinuation  of  Tay¬ 
lor  series  as  a  generic  phenomenon.  Let  H(D)  denote  the  space  of  analytic  functions  defined 
on  the  unit  disc  D  =  {z  |  \z\  <  1}  and  endowed  with  the  topology  of  uniform  convergence 
on  compact  subsets  of  D.  Let  A  be  a  complete  metric  space  consisting  of  analytic  functions 
defined  on  D  such  that  the  mappings  /  /(”)  [z)  are  continuous  from  X  to  C  whenever 
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n  6  N  and  z  ^  D.  Given  a  €  r  >  0,  and  >1  €  N,  let 

OO  - 

G{a,r,A)  =  {f  e  X  |  E 

n=0 

00  H 

J{a,T)  =  fl  G{a,r,A)  =  {f  e  X  |  ^  =  00} . 

yl>0  n=0 

The  G(a,  r,  A)  are  open,  and  J{a,  r)  is  a  G5  set.  Define 

J  =  n7(a,r), 

a,r 

where  the  coordinates  of  a  and  r  are  rational  and  r  >  1  —  |a|.  Then  J  is  also  a  Gs  set,  and 
the  functions  /  belonging  to  J  are  not  continuable  across  the  circle  \z\  =  1. 

It  follows  that  quasi  all  /  in  X  are  noncontinuable  as  soon  as,  given  any  domain  Q  strictly 
larger  than  D,  the  set  of  /  that  cannot  be  continued  analytically  on  O  is  dense  in  X.  In  par¬ 
ticular,  functions  represented  by  (R),  (S),  or  (G),  as  well  as  functions  belonging  to  H{D), 
are  quasi  surely  not  continuable  across  the  circle.  This  result  is  due  to  Kierst  and  Szpilrajn 
for  H{D)  [39],  and  it  can  be  extended  in  a  number  of  ways,  by  replacing  D  with  any  other 
domain  in  C  or  C^,  in  the  following  form:  Either  all  functions  belonging  to  the  space  under 
consideration  can  be  extended  in  the  same  larger  domain,  or  quasi  all  functions  are  noncon¬ 
tinuable  [28]. 

Article  [28]  contains  many  generic  properties  of  trigonometric  and  Taylor  series.  Let  me 
quote  a  few  of  them. 

1.  X  =  C(T).  The  partial  sums  S'„(/,  t)  of  the  Fourier  series  of  quasi  all  /  satisfy 

hm  (/,<)  —  00  quasi  everywhere 

n-j-cjo  Un 

whenever  0  <  Wn  =  o(logn)  as  n  ^  00. 

2.  X  L^(T).  The  partial  sums  5„(/,i)  of  quasi  all  /  satisfy 

him  -  00  everywhere 

n^oo  An 

whenever  0  <  An  =  as  n  00.  (This  is  the  generic  Konyagin  theorem  [40].) 

3.  X  =  H{D).  Given  any  open  subset  of  D,  say  A,  such  that  the  boundaries  of  D  and  A 
have  at  least  one  point  in  common,  quasi  all  /  satisfy 

/(e^^A)  C  for  all  6  eR. 

4.  X  =  H(D).  The  Taylor  series  of  quasi  all  /  satisfy  the  Nestoridis  universality  prop¬ 
erty:  Given  any  compact  set  K  in  {z  \  \z\  >  1}  such  that  C  \  X  is  connected,  and  given 
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any  function  F  that  is  continuous  on  K  and  analytic  in  its  interior,  there  exists  a  sequence  of 
partial  sums  that  converge  to  F  uniformly  on  K  [47].  As  a  corollary,  one  can  show  that  every 
continuous  function  on  {z  \  |>2:|  =  1}  is  a  pointwise  limit  of  some  sequence  of  partial  sums. 

5.  X  —  Co(N).  Quasi  all  Fourier  Taylor  series  (on)  6  X,  have  the 

Menchoff  universality  property:  Every  Lebesgue-measurable  function  is  a  limit  of 

some  sequence  of  partial  sums  almost  everywhere  [31]. 


6.  Thin  sets  and  function  spaces 


The  last  part  of  this  paper  concerns  thin  sets  in  Z  and  T.  “Thin”  refers  to  properties  related 
to  trigonometric  series.  Furthermore,  the  thin  sets  we  consider  in  Z  are  lacunary,  and  the  thin 
sets  we  consider  in  T  are  closed  and  have  zero  Lebesgue  measure. 

Probability  methods  have  proven  to  be  quite  efficient  for  the  study  of  thin  sets  in  Z, 
however,  so  far,  I  know  of  no  use  of  the  Baire  theory  in  this  context.  On  the  other  hand,  both 
probability  methods  and  Baire’s  method  are  actively  used  for  exhibiting  properties  of  thin 
sets  in  T.  As  one  would  expect,  the  results  obtained  by  the  two  methods  are  quite  different 
and  often  go  in  opposite  directions.  A  way  to  relate  them  was  discovered  by  Komer;  it  will 
be  sketched  at  the  end  of  the  paper. 

The  first  use  of  thin  sets  of  integers  was  made  by  Hadamard  for  studying  lacunary  Tay¬ 
lor  series.  Assuming  that  (An)  is  an  increasing  sequence  of  positive  integers  such  that  (*) 
An+i/An  >  9  >  1  for  all  n,  he  showed  that  all  series  with  a  finite,  nonzero 

radius  of  convergence  are  noncontinuable.  Condition  (*)  is  far  from  being  the  best  for  this 
kind  of  result.  However,  (*)  plays  a  role  in  a  number  of  questions  in  harmonic  analysis.  It  is 
called  the  Hadamard  lacunary  condition. 

Let  me  proceed  with  the  definitions.  C'(T),  Lp(T)  (1  <  p  <  oo),  co(Z),  and  /^(Z) 
(1  <  p  <  oo)  have  the  usual  meaning.  C'as(T)  is  the  subspace  of  L^(T)  consisting  of 
functions  /  ~  such  that  X)nGZ  represents  a.s.  a  continuous  func¬ 

tion.  Then,  according  to  Marcus  and  Pisier,  the  same  is  true  for  the  Gaussian  Fourier  series 
where  the  Cn  are  i.i.d.  normalized  Gaussian  variables,  and  Cas(T)  is  a  Ba¬ 
nach  space  using  either  of  the  equivalent  norms 


E 


nez 


,27rint 


C(T) 


or  E 


EC"/"' 

nez 


,27rinf 


C(T) 


The  Pisier  algebra  is  C(T)  D  Ca5(T).  A(T)  =  Fl^{Z)  is  the  Wiener  algebra,  which  is  the 
subspace  of  those  functions  in  C(T)  whose  Fourier  series  converge  absolutely  (see  Section 
1).  The  space  of  Radon  measures  on  T  is  denoted  by  M (T),  and  the  cone  of  probability  mea¬ 
sures  is  denoted  by  Mi{T).  The  pseudomeasures  are  linear  forms  on  A(T),  in  other  words, 
Schwartz  distributions  on  T  whose  Fourier  coefficients  are  bounded.  The  space  of  pseu¬ 
domeasures,  is  denoted  by  FM(T).  Pseudofunctions  are  pseudomeasures  whose 
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Fourier  coefficients  tend  to  0.  The  space  of  pseudofunctions,  ^co(Z),  is  denoted  by  PF{T). 
The  Rajchman  measures  are  at  the  same  time  measures  and  pseudofunctions.  We  write 

Mo(T)  -  M(T)  n  PF(r)  =  {fie  M(T)  I  fi(n)  =  o(l),  n  ->  ±oo}. 

We  shall  also  be  interested  in  the  classes  0  ^  a  ^  1,  defined  as 

M„(T)  =  {ne  M(T)  I  /i(n)  =  o{\n\-°l%  n  ±oo}. 

7.  Sidon  and  Zygmund  sets 

Given  any  family  F{T)  of  functions  or  distributions  defined  on  T  and  any  subset  A  of 
Z,  we  write  F\  for  the  subfamily  of  F{T)  whose  elements  have  their  spectrum  in  A.  Thus, 
/  €  Fa  if  and  only  if 

Here  is  the  first  and  most  important  definition  of  a  thin  set  of  integers. 

Dl.  A  is  a  Sidon  set  means  that  C^  —  Aa. 

Equivalent  definitions,  known  already  in  the  1930s,  are  M|a  =  FM|a  and  |a  =  FF|a. 
(Here  we  write  F|a  for  the  space  of  restrictions  to  A  of  Fourier  transforms  of  elements  of 
F(T);  then  FM|a  =  /°°(A)  and  PF|a  =  Co(A).)  The  structure  of  Sidon  sets  is  not  yet 
clarified.  Probability  methods  have  helped,  and  there  have  been  two  approaches:  1)  the  use 
of  random  functions  with  a  given  spectrum,  and  2)  the  use  of  random  sets  of  integers. 

The  first  method  was  introduced  in  1957  [18, 19]  and  gave  the  following  result:  If  A  is  a 
Sidon  set,  there  exists  a  AT  >  0  such  that  given  positive  integers  n  and  s,  every  “net”  of  the 
form 

n 

mj  6  Z.El'^jl  <  2*l}, 

j  =  l 

where  the  aj  are  real,  contains  no  more  than  Kns  points  of  A.  It  is  not  known  whether  this 
necessary  condition  is  also  sufficient.  The  triumph  of  the  method  is  an  alternative  definition 
of  Sidon  sets,  from  which  the  theorem  of  Drury — the  union  of  two  Sidon  sets  is  a  Sidon 
set — follows  immediately: 

D2.  A  is  a  Sidon  set  if  and  only  if  CasA  = 

This  is  due  to  D.  Rider  [52].  The  characterizations  given  by  Pisier  [49,50]  and  by  Borgain 
[6]  use  the  same  approach. 

The  second  method  was  introduced  by  Katznelson  and  Malliavin  in  1966  [36]  in  con¬ 
nection  with  the  “dichotomy  problem”:  For  a  rather  large  class  of  random  sets  of  integers, 
A  =  A(a;),  either  A  is  a  Sidon  set  or  only  analytic  functions  operate  on  L^|a.  Katznelson 
proved  the  following  result  in  1972  [35]:  In  the  second  case,  A  is  dense  in  the  Bohr  group, 


N{ai,  02,  ■  •  •  5  o,n\  5)  =  jaimi  F  a2m2  H - f- 
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that  is,  the  Bohr  compactification  of  Z.  Moreover,  he  indicated  a  way  to  construct  Kudin’s 
A{p)  sets  by  a  random  procedure.  (A  is  A(p)  means  that  L‘^\A{—  l‘^{A))  =  I?|a.)  This  was 
later  developed  by  Borgain  in  [7]  to  obtain  A(p)  sets  that  are  not  A{p  +  e).  Borgain’s  selector 
method  consists  in  choosing  A  as  the  support  of  the  random  measure  where  Sn 

is  the  Dirac  mass  at  n,  and  where  the  Xn  are  independent  random  variables  with  values  0  or 
1  such  that  EXn  —  for  some  given  sequence  a„,  0  <  <  1.  When  <2„  =  0(1/  log  |n|), 

n  dioo,  A  is  a.s.  a  Sidon  set.  When  ja^l  log  |n|  tends  to  oo,  there  is  a.s.  a  trigonometric 
series  sin  Xt  that  is  uniformly,  but  not  absolutely,  convergent.  New  developments 

are  given  in  [45]. 

It  is  through  definition  D2  that  Sidon  sets  are  related  to  almost  surely  everywhere  con¬ 
vergent  (or  equivalently,  uniformly  convergent)  trigonometric  series.  The  Zygmund  sets  (or 
sequences),  which  I  shall  define  below,  are  related  to  almost  surely  somewhere  convergent 
trigonometric  series,  which  we  have  already  considered.  They  originate  from  a  theorem  of 
Zygmund  [57]  and  a  theorem  of  Paley  and  Mary  Weiss  [56]  on  Hadamard  real  trigonometric 
series  and  Hadamard  Taylor  series  with  coefficients  tending  to  zero.  From  now  on,  A  will 
denote  a  set  of  positive  integers,  ordered  increasingly. 

D3.  A  is  a  Zygmund  set  whenever  every  real  trigonometric  series 

AGA 

with  ax  =  o(l),  A  00,  converges  at  some  point  t  eT. 

D4.  A  is  a  Zygmund"^  set  whenever  every  Taylor  series  X)aga  =  o(l), 

A  — >•  oo,  converges  at  some  2:,  \z\  =  1. 

Every  Zygmund"**  set  is  a  Zygmund  set.  It  is  not  known  if  the  converse  is  true.  Necessary 
conditions  for  A  to  be  a  Zygmund  set  were  given  by  Erdos  [13]  and  by  me  [21],  and  they  are 
close  to  the  necessary  conditions  known  for  Sidon  sets.  Is  every  Zygmund  set  a  Sidon  set?  Is 
every  Sidon  set  in  N  a  Zygmund  set?  These  are  old  questions,  hardly  considered. 

8.  Kronecker,  Helson,  M,  and  Salem  sets 

We  are  now  going  to  consider  closed  subsets  of  T.  I  shall  restrict  myself  to  Kronecker 
sets,  Helson  sets,  M  sets,  sets  with  0  <  a  <  1,  and  Salem  sets.  Here  are  the  definitions 
( [29,33]). 

D5.  E  (si  closed  subset  of  T)  is  a  Kronecker  set  if  each  function  of  modulus  1  that  is  con¬ 
tinuous  on  E  is  the  uniform  limit  of  some  sequence  of  imaginary  exponentials  exp(27rmji). 

From  now  on  I  shall  write  C{E)  for  the  Banach  space  of  continuous  functions  on  E  and 
A{E)  for  the  Banach  space  consisting  of  the  restrictions  to  E  of  functions  belonging  to  A(T). 

D6.  E  is  a  Helson  set  if  A{E)  —  C{E). 


68 


J.-P.  Kahane  /  Probabilities  and  Baire’s  theory  in  harmonic  analysis 


An  equivalent  definition  is  that  the  measure  norm  and  the  pseudomeasure  norm  are  equiv¬ 
alent  for  all  measures  //  e  M{E),  that  is,  measures  supported  by  E.  The  first  theorem  about 
Helson  sets,  due  to  Helson,  is  that  a  Helson  set  carries  no  nonzero  measure  belonging  to 
Mo(T)  [16].  A  long  standing  question,  whether  a  Helson  set  can  carry  a  pseudofunction, 
was  solved  positively  by  T.  Komer  in  1972  [41].  The  best  proof  uses  a  probability  device  of 
R.  Kaufman  [38]. 

D7.  is  an  M  set  if  it  carries  a  nonzero  pseudofunction,  or  in  our  notation,  if  PF{E)  / 

{0}. 

An  M  set  is  also  called  a  set  of  multiplicity  because  there  are  infinitely  many  trigono¬ 
metric  series  that  converge  to  0  outside  the  set.  The  opposite  is  called  a  set  of  uniqueness.  A 
Kronecker  set  is  both  a  Helson  set  and  a  set  of  uniqueness. 

D8.  E  is  an  Mq  set  if  it  carries  a  nonzero  measure  p,  such  that  ji{n)  —  o(l)  as  |n|  -)■  oo. 

Helson’s  theorem  says  that  an  Mq  set  cannot  be  a  Helson  set;  Komer’s  construction  shows 
that  an  M  set  can  be  a  Helson  set. 

D9.  E  is  an  set  if  Ma{E)  {0},  which  means  that  E  supports  a  probability  measure 
fi  such  that  fi{n)  =  o(|n|~“/^)  as  |n|  oo. 

It  follows  from  a  theorem  of  Frostman  that  the  Hausdorff  dimension  of  an  Ma  set  is  >  a. 

DIO.  jE^  is  a  Salem  set  of  dimension  o;  if  its  Hausdorff  dimension  is  a  and  if  E  is  an 
set  for  every  /?  <  a. 

9.  Random  thin  sets 

Probability  methods  have  been  used  to  obtain  Salem  sets  in  different  ways.  The  problem 
of  finding  Salem  sets  of  dimension  a  for  each  a  between  0  and  1  was  suggested  by  Beurling 
[33]. 

Salem’s  construction,  apart  from  probability  methods,  uses  Diophantine  approximation, 
and  it  produces  sets  that  have  strong  arithmetical  properties,  of  the  same  type  as  the  usual 
Cantor  sets.  This  construction  is  described  in  [33]. 

Processes  with  independent  increments,  in  particular  Levy  processes  and  Brownian  mo¬ 
tion,  provide  Salem  sets  in  a  most  natural  way:  The  images  of  a  fixed  set  of  dimension  a/2 
under  Brownian  motion  is  a.s.  a  Salem  set  [24].  There  is  an  analogue  of  this  result  for  Levy 
flights  [26,30,46].  The  first  result  in  this  direction  was  suggested  by  B.  Mandelbrot  when  he 
became  interested  in  the  “Levy  dust.”  Here  is  the  result.  Recall  that  any  stationary  increasing 
process  with  independent  increments,  a  Levy  flight,  is  a  mapping  A  :  E"''  x  D  — >  E"''  that 
satisfies  the  following  properties:  For  almost  all  a;,  X{t,uj)  is  an  increasing  function  of  t, 
it  is  continuous  to  the  right,  and  A(0,ci;)  =  0  for  all  u.  Furthermore,  for  each  choice  of 
0  ^  ^1  ^  ^2  ^  ^  the  random  variables  •)  —  X (tj,  •),j  —  1,  2, . . . , n  —  1,  are 
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independent  and  their  law  depends  only  on  tj+i  -  tj.  The  function”  associated  with  this 
process  is  defined  by  the  equation 

^  x{t)  =  x(t,  •), 

and  the  law  of  the  process  depends  only  on  ip.  Then  the  image  of  the  Lebesgue  measure  on 
[0,1]  under  X (t)  is  a  random  measure  //  whose  Fourier  transform  is 

A(u)  =  r  dt. 

Jo 


Writing 

one  gets  successively 


and 


h{u)  =  inf  Ke‘il){x), 

\x\^u 


a.s. 


The  last  steps  are  copied  from  Salem  [33].  The  definition  of  h{u)  as  inf  |'0(a;)|  given  in  [30] 
and  [26]  is  incorrect.  The  correction  is  due  to  A.  Benchdrif-Madani  (see  [5],  p.  87).  In 
particular,  for  stable  Levy  processes  of  index  a,  'ip{u)  =  where  c  is  complex,  and  the 
support  of  fjL,  transported  to  T,  is  a  Salem  set  a.s. 

It  can  be  expected  that  most  “naturally  occurring”  random  sets  are  Salem  sets.  In  partic¬ 
ular,  level  sets  of  fractional  Brownian  motions  should  be  Salem  sets.  This  is  true  for  ordinary 
Brownian  motion  because  the  level  set  starting  from  0  coincides  with  the  closure  of  the  range 
of  a  Levy  process  of  index  1/2.  The  measure  to  be  considered  is  now  (5(X(-)).  The  method 
is  at  hand  in  the  last  chapter  of  [24],  but  it  was  never  carried  out. 

Examples  of  sets  where  spectral  synthesis  fails  on  the  line  M  were  first  given  by  Malliavin. 
His  idea  was  to  construct  a  function  /  e  A(T)  and  a  pseudomeasure  defined  formally  as 
S'(f),  where  S'  is  the  derivative  of  the  Dirac  measure.  )^en  {S'{f),  f)  ^  0,  the  set  f~^(0) 
does  not  permit  spectral  synthesis.  Level  sets  of  specially  constructed  Gaussian  processes 
X{t)  have  this  property:  —  x)  is  a  pseudomeasure  that  cannot  be  approximated  in 

the  weak  topology  of  PM(T)  by  measures  carried  within  the  support  of  -  rc),  with 

a  positive  probability  that  depends  on  x.  It  was  noticed  that,  for  these  special  Gaussian 
processes,  this  pseudomeasure  belongs  to  PP  whenever  p  >  2  [20].  (The  statement  in  [20] 
is:  Whatever  p  >  2,  there  exists  a  process. . ..  It  is  easy  to  obtain:  There  exists  a  process 
such  that,  whatever  p  >  2, . . ..)  It  is  very  likely  that,  for  these  Gaussian  processes,  the 
pseudomeasure  belongs  to  Mq(T),  whatever  0  <  a  <  1/2  (perhaps  also  for  a  =  1/2),  but  it 
needs  some  work  to  prove  this. 
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10.  Generic  thin  sets 

I  shall  be  brief  on  Baire’s  methods — as  introduced  by  Robert  Kaufman  [37],  developed 
in  a  series  of  ways,  and  revived  recently  by  Thomas  Komer  [42] — because  the  matter  is 
treated  in  [28].  However,  it  is  worth  mentioning  the  power  and  versatility  of  Baire’s  methods 
for  obtaining  strange  closed  sets  on  R  or  T.  (Definitions  D5  to  DIO  extend  to  R,  so  I  shall 
consider  R  instead  of  T  from  now  on.) 

The  first  idea  is  that,  contrary  to  the  smoothing  effect  of  random  processes,  using  Baire’s 
theorem  accentuates  the  wildest  behavior  of  Fourier  transforms:  Probability  provides  M  sets, 
and  even  Ma  sets,  as  we  have  seen;  Baire  provides  sets  of  uniqueness.  Actually,  Baire’s 
theorem  is  a  very  good  tool  for  obtaining  Kronecker  sets  [22]  or,  in  several  dimensions, 
Helson  curves  and  surfaces.  That  at  least  was  my  personal  philosophy  until  1993. 

Then  T.  Komer  discovered  that  his  Helson-M  sets  (so  strange!)  and  also  Salem  sets  (so 
far  from  Kronecker!)  are  also  generic  if  suitable  Baire  spaces  are  chosen  [42].  His  ideas  for 
the  constmction  of  Salem  sets  are  expounded  in  [28].  Probability  does  not  disappear,  but  it  is 
reduced  to  a  technicality. 

In  [42],  Komer  called  Baire’s  theorem  “a  profound  triviality.”  Its  proof  is  trivial,  and 
its  use  by  Kaufman  and  Komer  is  profound.  The  use  of  probability  methods  relies  on  long 
experience  in  analysis  and  on  the  stochastic  processes  that  we  have  at  hand.  To  apply  Baire’s 
theorem,  the  first  point  is  to  discover  the  right  Baire  space;  the  second  step  is  to  chose  the  right 
open  sets.  It  is  a  good  way  both  to  render  strange  objects  generic  and  to  find  new  phenomena. 

I  wish  to  thank  Robert  Ryan  for  a  careful  reading  of  this  paper. 
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Abstract.  Gabor  theory  is  concerned  with  expanding  signals  /  as  linear  combina¬ 
tions  of  elementary  signals  that  are  obtained  from  a  single  function  g  (the  window) 
by  shifting  it  in  time  and  frequency  over  integer  multiples  of  a  time  shift  parameter  a 
and  a  frequency  shift  parameter  6.  In  these  expansion  problems  a  key  role  is  played 
by  the  Gabor  frame  operator  associated  with  the  set  of  elementary  signals  used  in  the 
expansions.  The  Gabor  frame  operator  determines  whether  stable  expansions  exist  for 
any  finite-energy  signal  /  (that  is,  whether  we  have  indeed  a  frame),  and,  if  so,  gives 
a  recipe  for  computing  the  expansion  coefficients  by  using  the  canonically  associated 
dual  frame.  In  this  contribution  we  consider  the  Gabor  frame  operator  and  associated 
dual  frames  in  the  time  domain,  the  frequency  domain,  the  time-frequency  domain, 
and,  for  rational  values  of  the  sampling  factor  the  Zak  transform  domain.  We 

thus  have  the  opportunity  to  address  the  basic  problems  -  whether  we  have  a  Gabor 
frame  and  how  we  can  compute  a  dual  frame  -  in  any  of  these  domains  we  find,  de¬ 
pending  on  g  and  a,  6,  convenient.  The  representations  in  the  time  domain  and  the 
frequency  domain  are  conveniently  discussed  in  the  more  general  context  of  shift- 
invariant  systems,  and  for  this  we  present  certain  parts  of  what  is  known  as  Ron-Shen 
theory,  adapted  to  our  needs  with  emphasis  on  computational  aspects. 

This  contribution  contains  many  examples,  counter-intuitive  and  confusing  re¬ 
sults,  statements  that  one  would  like  to  be  true  but  that  are  not  and  vice  versa,  etc.,  to 
show  that  Gabor  theory,  despite  its  rapid  development  in  the  last  ten  years,  is  still  far 
from  being  completed. 


I.  Introduction 

To  introduce  the  subject  matter  of  this  contribution  conveniently,  we  start  with  a  survey  of 
the  various  origins  and  early  developments  of  Gabor  theory.  We  do  not  aim  here  at  a  complete 
historical  account,  certainly  not  for  the  later  developments,  but  rather  refer  for  this  to  some 
excellent  recent  and  less  recent  papers  and  books  containing  such  surveys,  Gabor  systems 
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are  systems  of  functions  of  a  real  variable  t  that  are  built  from  a  single  function  g  (called  the 
window)  by  shifting  it  in  time  and  frequency  over  integer  multiples  of  a  time  shift  parameter 
a  and  a  frequency  shift  parameter  b.  That  is,  denoting  for  real  x,  y  by  the  time-frequency 
shifted  version 

(1.1)  = 

of  g,  a  Gabor  system  with  shift  parameters  a,  b  and  window  g  consists  of  the  functions 
9na,mb  with  integer  n,  m.  We  denote  this  system  by  {g,  a,  b).  These  systems  were  considered 
by  Gabor  [1]  in  1946,  with  the  window  g  a  Gaussian  and  (ab)~'^  ~  1,  with  the  aim  of 
constructing  efficient,  time-frequency  localized,  non-redundant  expansions  of  finite-energy 
signals  as  linear  combinations  of  the  system’s  elements  in  which  the  coefficients  “represent” 
the  expanded  signal.  Gabor’s  choice  of  Gaussian  elementary  building  blocks  and  densities 
with  product  equal  to  unity  was  motivated  by  his  desire  for  non-redundant,  unique 
expansions  that  should  exist  for  any  finite-energy  signal.  Indeed,  Gaussians  uniquely  achieve 
equality  in  the  uncertainty  inequality  At-  Af  ^  1/2  (with  the  deltas  referring  to  the  standard 
deviations  in  the  time  and  the  frequency  domain,  respectively),  whence  they  occupy  in  a  sense 
the  least  amount  of  area  in  the  time-frequency  plane.  Furthermore,  the  setting  of  Nyquist’s 
theorem,  saying  that  band-limited  signals  are  uniquely  determined  by  their  sample  values  at 
regularly  spaced  sample  points  with  the  spacing  determined  by  the  bandwidth,  can  be  recast 
into  a  limiting  case  of  a  Gabor  expansion  problem,  and  this  suggests  to  take  =  1  as 

critical  density  when  non-redundant  expansions  have  to  exist  for  all  signals. 

Already  much  earlier,  in  1932,  systems  ig,a,b)  with  Gaussian  g  and  {ab)~^  =  1  were 
considered  by  von  Neumann  [2]  in  a  quantum  mechanical  context,  and,  apparently,  he  estab¬ 
lished  the  completeness  of  these  systems  in  L^.  For  that  reason  one  also  finds  in  the  literature 
the  name  von  Neumann  lattice  systems,  and  also  Weyl-Heisenberg  systems  to  emphasize  the 
underlying  continuous  Weyl-Heisenberg  group  of  translations  in  the  phase  plane,  for  what  we 
have  called  Gabor  systems. 

Gabor’s  1946  paper  certainly  did  not  go  unnoticed  by  the  engineering  community,  but  it 
was  not  until  1980  that  the  attention  to  Gabor  expansions  was  revived  through  the  work  of 
Portnoff  [3],  Bastiaans  [4]  and  Janssen  [5].  This  revival  coincided,  not  entirely  by  accident, 
with  the  increasing  interest  in  the  electrical  engineering  community  in  time-frequency  tools, 
such  as  the  Wigner-Ville  distribution  and  the  short-time  Fourier  transform.  (It  should,  how¬ 
ever,  be  noted  that  as  early  as  1961  Lemer  [6]  presented  a  theory  of  signal  representations 
in  which  Gabor  expansions  play  a  dominant  role;  in  [6]  one  can  already  find  orthogonaliza- 
tion  procedures  reminiscent  of  what  we  presently  would  call  the  construction  of  canonically 
associated  dual  frames.)  From  the  von  Neumann  lattice  side,  completeness  results  for  the 
Gaussian  window  and  (a6)~^  =  1  were  already  obtained  by  Perelomov  [7]  and  Bargmann  et 
al.  [8]  using  the  Bargmann  transform  (and,  in  [8],  the  Zak  transform  in  disguised  form)  and 
by  Bacry,  Grossmann  and  Zak  [9]  using  the  Zak  transform. 
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Although  Perelomov,  in  [7],  already  presented  some  considerations  on  dual  functions,  it 
was  Bastiaans  in  [4]  who  analytically  computed  a  dual  function  for  the  case  of  Gaussian  g  and 
{ah)~^  =  1.  These  dual  functions  are  important  since  they  allow  one  to  exhibit  the  expansion 
coefficients  for  a  particular  signal  /  as  inner  products  of  /  with  the  dual  function  shifted 
in  a  similar  way  as  the  window  g  itself.  The  mathematical  analysis  given  by  Janssen  in  [5] 
and  [10]  of  the  convergence  properties  of  Gabor  expansions  and  of  Bastiaans’  dual  function 
showed  that  Gabor  systems  with  Gaussian  g  and  {ab)~^  =  1  yield  unstable  expansions  that  do 
not  properly  reflect  time-frequency  localization  of  the  signals  to  be  expanded.  This  point  was 
also  observed  by  Davis  and  Heller  in  1979  [11];  they  suggested  to  consider  Gabor  systems 
with  Gaussian  window  g  and  >  1  (oversampling)  and  thus  obtained  expansions  with 

much  better  convergence  properties. 

The  interest  of  mathematicians  in  Gabor  systems  dates  from  around  1980  with  Janssen’s 
work  [5],  [10]  on  the  connection  between  the  Bargmann  transform,  Zak  transform  and  Gabor 
expansions,  and  that  of  Feichtinger  (joined  later  on  by  Grdchenig)  focusing  on  the  more 
functional  analytic  (modulation  spaces)  and  group  theoretic  aspects  of  Gabor  expansions  [12], 
[13].  A  major  development  in  the  mathematical  theory  of  Gabor  expansions  was  made  in 
1986  by  Daubechies,  Grossmann  and  Meyer  [14]  who  placed  the  Gabor  expansion  problem 
in  the  context  of  frames  for  a  Hilbert  space.  The  latter  concept  was  introduced  by  Duffin  and 
Schaeffer  [15]  in  1951  for  addressing  completeness  and  expansion  problems  involving  sets 
of  exponentials  in  spaces  of  band-limited  functions.  For  a  Gabor  system  (5,  a,  h)  one  thus 
considers  the  frame  operator  5,  defined  for  /  6  by 

(1.2)  Sf  —  ^  ^  (/)  9na,mb)  9na,mb  * 

n,m 


By  definition,  the  Gabor  system  {g,  a,  b)  is  a  frame  when  the  frame  operator  S  is  bounded 
and  positive  definite.  In  this  case,  the  Gabor  system  (°7,  a,  b)  with 

(1.3)  °7  =  S-^g 

is  also  a  frame,  called  the  canonical  dual  frame,  and  for  any  f  e  L‘^  we,  have  the  L^- 
convergent  expansions 


(1.4) 


/  —  ^  ^  (/j  fna,mb)  9na,mb  —  (/?  9na,mb) 


fna^mb 


Many  of  the  research  efforts  in  Gabor  theory  after  1986  were  directed  at  studying  frame 
operators,  finding  criteria  for  when  a  Gabor  system  is  a  frame,  identification  of  the  tight  Gabor 
frames  (for  which  g  and  °7  coincide,  except  for  a  factor),  and  how  to  efficiently  compute  the 
canonical  dual  and  the  expansion  coefficients  in  (1.4).  These  problems  and  their  solutions 
have  attracted  many  scientists  from  quite  diverse  disciplines  and  fields,  such  as  theoretical 
electrical  engineering,  mathematical  physics,  Fourier  analysis,  numerical  analysis,  complex 
function  theory,  functional  analysis,  where  it  should  be  noted  that  especially  the  last  field  has 
increased  its  share  of  practitioners  considerably  over  the  last  few  years. 
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We  have  now  arrived  at  a  point  where  we  are  able  to  describe  the  technical  content  of  this 
contribution,  and  so  we  finish  the  historical  survey  by  pointing  at  two  basic  references  for 
the  developments  in  Gabor  theory  (before  and)  after  1986.  These  are  Ch.  4  of  the  book  [16] 
by  Daubechies  (having  an  enormous  influence  on  the  more  recent  developments  in  time- 
frequency  analysis,  and,  in  particular,  Gabor  theory)  and  the  book  [17],  edited  by  Feichtinger 
and  Strohmer,  which  is  entirely  devoted  to  Gabor  analysis  and  applications  with  an  extensive 
and  up-to-date  bibliography.  For  a  survey  of  Gabor  theory  until  1989  and  an  excellent  tutorial 
for  both  Gabor  theory  and  wavelet  theory,  one  should  also  consult  the  survey  paper  [18]  by 
Heil  and  Walnut. 

In  this  contribution  we  focus  on  the  various  representations  of  the  frame  operator  S 
in  (1.2).  To  that  end,  we  first  present  the  basics  of  frame  theory,  specialized  to  Gabor  systems 
and  to  the  more  general  shift-invariant  systems.  For  the  latter  type  of  systems  we  provide 
our  version  of  certain  parts  of  a  theory  developed  by  Ron  and  Shen  [19],  [20]  where  we  pay 
special  attention  to  the  issue  of  how  to  compute  canonical  dual  systems.  This  is  applied  in 
two  ways  to  Gabor  systems,  yielding  a  description  of  the  frame  bound  conditions,  the  Ga¬ 
bor  frame  operator,  the  duality  relation  (1.4)  and  a  characterization  of  and  a  computation 
method  for  the  canonical  dual  function,  both  in  the  frequency  domain  and  the  time  domain. 
The  representation  of  the  frame  operator  in  the  time  domain  is  well  known  as  Walnut’s  rep¬ 
resentation  [21].  We  shall  also  consider  Gabor  systems  in  the  time-frequency  domain  using 
spectrograms,  and  this  yields  the  Tolimieri-Orr- Janssen  representation  [22],  [23]  of  the  Gabor 
frame  operator  with  a  corresponding  description  of  the  frame  bound  conditions,  the  Wexler- 
Raz  biorthogonality  condition  [24]  for  the  duality  relation  (1.4),  and  a  characterization  of 
the  canonical  dual  function  as  the  minimum-energy  Wexler-Raz  dual.  For  rational  values  of 
the  sampling  factor  {ab)~^  we  can  also  consider  Gabor  systems  in  the  Zak  transform  domain 
which  yields  the  Zibulski-Zeevi  description  [25]  of  the  frame  bound  conditions,  frame  opera¬ 
tor,  duality  relation  and  characterization/computation  of  the  canonical  dual  function  in  terms 
of  Zak  matrices.  Each  of  the  four  representations  just  given  is  potentially  useful  as  a  tool  for 
finding  out  whether  a  Gabor  system  (g,  o,  b)  is  indeed  a  frame,  and,  if  so,  offers  a  means  for 
computation  of  (canonical)  dual  functions  in  the  considered  domain. 

We  conclude  this  contribution  with  various  counter-intuitive  and  confusing  results,  state¬ 
ments  that  one  would  obviously  like  to  be  true  but  that  are  not  and  vice  versa,  comments 
on  the  basic  and  hard  problem  of  when  a  particular  triple  (^,  a,  6)  is  a  Gabor  frame,  etc.  As 
examples  of  this  we  have  the  Balian-Low  theorem  (conflicting  with  the  relaxed  attitude  of 
von  Neumann,  Gabor  himself  and  Lemer  towards  completeness,  existence  and  convergence 
issues  for  the  Gaussian  window  Gabor  system  at  critical  density)  and  the  beating  of  this  same 
Balian-Low  phenomenon  by  considering  Wilson  systems  at  critical  density;  the  existence  of 
a  well-behaved,  positive  g  with  positive  Fourier  transform  such  that  (5-,  1)  is  not  a  Gabor 

frame;  the  difficulties  of  deciding  whether  (5,0,6)  is  a  frame  with  06  <  1  for  windows  g 
as  elementary  as  a  Gaussian  or  the  characteristic  function  of  an  interval.  All  this  shows  that 
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Gabor  theory,  despite  the  great  progress  that  has  been  made  in  recent  years,  is  still  far  from 
being  completed,  with  various  basic  questions  still  waiting  to  be  answered. 

Almost  all  results  presented  here  are  proved  somewhere  in  the  literature;  we  shall  there¬ 
fore  omit  all  proofs  and  we  shall  give  appropriate  references  instead.  In  Secs.  2-6  of  this 
contribution  we  follow  roughly  the  developments  of  Secs.  1.1-5  of  [17],  Ch.  1;  however,  the 
presentation  of  the  results  has  been  considerably  enhanced  by  adopting  a  uniform  organiza¬ 
tion  per  section,  while  some  of  the  results  have  been  worked  out  in  more  detail. 

2.  Basics  from  frame  theory 

In  this  section  we  present  some  basic  facts  from  frame  theory,  with  particular  attention  for 
Gabor  systems  and  shift-invariant  systems.  A  shift-invariant  system  consists  of  a  collection 
of  functions  gnm^  n,  m  G  Z,  of  the  form 

(2.1)  Pnm(^)  ~  9m{^  ^  €  M.  , 

where  E  e  Z,  and  a  >  0.  We  are  interested  in  finding  dual  systems  7„^,  n,  m  G  Z, 
with  7nm(^)  =  7m(^  ~  i  G  M,  by  which  we  mean  that  any  f  E  L‘^  has  the  -convergent 
expansions 

(2.2)  /  =  ^  ^  (/}  7nm)  9nm  —  ^  ^  {fi9nm)  7nm  • 

n,m  n,m 

For  this  to  be  meaningful,  we  require  the  two  systems  to  have  a  finite  frame  upper  bound.  A 
system  5nm»  n,  m  G  Z,  as  in  (2.1),  has  a  finite  frame  upper  bound  when  there  is  a  <  oo 
such  that 

(2.3)  /6L^ 

n,7n 

and  any  Bg  <  oo  such  that  (2.3)  holds,  is  called  a  frame  upper  bound. 

When  gnm,  n^mE'L,  has  a  finite  frame  upper  bound  Bg,  one  can  define  the  operators  Tg 
(analysis  operator)  and  T*  (synthesis  operator)  by 

(2.4)  Tg:fEL‘^-^Tgf  =  ((/,  9nm))n,m^Z  e  l\l?) 
and 

(2.5)  T*  :  a  E  (^{1?)  T* a  =  OLnm9nm  ^  > 

n,m 

1 

respectively.  These  Tg  and  T*  are  bounded  linear  operators  with  operator  norm  ^  B^ ,  and 
they  are  indeed  adjoint  operators  when  the  standard  inner  products  for  and  Z^(Z^)  are 
taken.  When  the  system  n,m  eIj,  has  a  finite  frame  upper  bound  as  well,  the  duality 
condition  (2.2)  can  be  written  as 

(2.6) 


T;T,  =  T;r,-/, 
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where  I  denotes  the  identity  operator  of 

When  the  system  n,m  E  Z,  has  a  finite  frame  upper  bound  Bg,  the  frame  operator 
Sg  is  defined  by  Sg  —  T*Tg.  Explicitly, 

(2.7)  Sg  :  f  E  L‘^  Sgf  —  ^  (/,  5nm)  9nm  ^  ) 

n,m 

and  there  holds  Sg  <  Bgl.  When  there  is,  in  addition,  an  >  0  such  that 

(2.8) 

n,m 

SO  that  Sg  is  invertible  with  Sg  ^  Agl,  we  say  that  the  system  n,m  E  Z,  has  a  positive 
frame  lower  bound,  and  any  Ag  >  0  such  that  (2.8)  holds  is  called  a  frame  lower  bound.  A 
system  gnm^  n^m  E  Z,  having  both  a  finite  frame  upper  bound  and  a  positive  frame  lower 
bound  is  called  a  frame.  When  we  have  Ag  —  Bg  in  (2.3)  and  (2.8),  we  say  that  the  frame  is 
tight,  and  then  we  have  Sg  =  Agl  —  Bgl. 

When  the  system  g^rn^  n,  m  G  Z,  is  a  frame,  a  dual  system  is  given  by 

(2.9)  'Ynm  —  Sg  gnm  i  n,  772  €  Z  , 

and  this  system  is  also  a  frame  with  frame  bounds  Ao^  =  B~^,  B^  —  Ag^.  Since  Sg,  and 
hence  commutes  with  all  relevant  time-shift  operators  f  E  f{’  -  no)  E 

72  €  Z,  we  have  that 

(2.10)  7nm  ~  Sg  gnm  —  {Sg  5m) ('  “  72(2)  ,  71,771  E  Z  . 

We  have,  furthermore,  that  SgS^  =  /,  whence  So^  is  the  inverse  of  the  frame  operator  Sg  and 
vice  versa.  The  system  in  (2.9)  is  called  the  canonical  dual  system. 

When  the  system  72,772  E  Z,  is  a  frame,  there  are,  in  general,  other  dual  systems 
7nm»  71,771  E  Z,  than  the  canonical  dual  system  in  (2.9).  When  we  have  two  systems  5nm. 
72, 772  6  Z,  and  7^^,  72, 772  G  Z,  both  with  a  finite  frame  upper  bound,  such  that  the  duality 
condition  (2.2)  with  -convergence  for  all  f  E  L?-  holds,  then  both  systems  are  a  frame.  This 
can  be  a  useful  method  for  checking  whether  a  particular  system  gnm,  72, 772  €  Z,  is  indeed  a 
frame,  viz.  in  those  cases  that  one  can  easily  produce  a  dual  system  fnm,  71,771  eZ,  that  does 
not  need  to  be  the  canonical  dual  frame  in  (2.9). 

The  dual  system  in  (2.9)  is  special  for  several  reasons.  For  any  f  E  and  any  a  E  /^(Z^) 
with 

(2.11)  f  —  ^  ^  TXnmgnm  , 

n,m 

n,m 


there  holds 

(2.12) 


n,m 
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with  equality  if  and  only  if  anm  =  (/,  ""inm)  for  all  n,m  e  Z.  By  applying  this  to  the  trivial 
representation  for  n,m  gZ 

(2.13)  9nm  —  ^  ^  (pnm?  Tn'm')  Qn'm'  —  Qnm  ^  ^  0  ‘  Qn’m'  j 

n' ,m’  (n' ,Tn')/:(n,m) 

we  find  that 

(2.14)  |(flnm,°7r!m)|^  ^  KSnm,  Vm')P  <  1  • 

n',m' 

A  different  way  to  characterize  the  dual  system  in  (2.9)  is  as  follows.  Assume  that  a  e  P 
is  given  and  that  we  consider  the  anm  as  noisy/distorted  versions  of  the  numbers  (f,gnm)^ 
n^m  Gli,  of  some  f  G:  L?.  Then  an  estimate  of  /  can  be  obtained  by  minimizing 

(2.15)  ^(/)  =  E  i(/’5nm)-anmP. 

n,m 

When  the  system  n,  m  €  Z,  is  a  frame,  this  yields  for  /  the  unique  solution 

(2.16)  f  =  Sg  ^  0inm9nm^  ~  ^  ^  O^nm  Inm  • 

n,m  n,m 

In  particular,  when 

(2.17)  O^nm  ~  ^nno^mmo  j  Ti,  ?7l  G  Z 

with  some  no,  mo  6  Z  (the  deltas  denote  ICronecker’s  delta),  we  obtain  /  =  °7nomo-  For  more 
generalities  about  frames  and  shift-invariant  systems  we  refer  to  [16],  Sec.  3.2,  [26],  Sec.  I.C 
and  [19],  Sec.  1.3. 

A  particular  example  of  a  shift-invariant  system  arises  when  we  take  for  m  E  Z 

(2.18)  9m{t)  =  e^^^^^^9{t).teR, 

with  6  >  0  and  ^  E  L^.  It  is  customary  here  to  ignore  the  phase  factors  in  7nm,  given  by 
exp(—27rinmab)  for  n,m  e  Z,  when  studying  duality  questions,  since  these  vanish  anyway 
at  the  right-hand  sides  of  (2.2).  Thus  one  considers 

(2.19)  gnaMt)  =  9(t  -  na)  vs.  g{t  -  na) 

for  n,m  eZ,  and  one  arrives  at  a  Gabor  system  (^,  a,  b)  as  in  Sec.  1  with  window  g  and  shift 
parameters  a  >  0,  6  >  0. 

In  the  case  of  a  Gabor  frame  (g,  a,  6),  the  frame  operator  Sg  commutes  with  all  relevant 
time-frequency  shift  operators  f  e  exp(2nimb')  /(•  —  na)  E  n,  m  E  Z.  As  a 

consequence  we  have  then  that 

(2.20)  fnm  ~  Sg  gnm  ~  (  'i)na,7nb 

with  —  Sg^  9  the  canonical  dual  window. 
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3.  Shift-invariant  systems 

In  this  section  we  consider  shift-invariant  systems  gnm^  n,m  e  Z,  and  7nmi  n,  m  G  Z, 
and  we  present,  in  the  frequency  domain  an  equivalent  condition  for  a  shift-invariant  system 
to  have  a  finite  frame  upper  bound  and  to  be  a  frame,  a  representation  result  for  the  frame 
operators,  an  equivalent  condition  for  the  two  systems  to  be  dual,  and  a  characterization  of 
and  a  computation  method  for  the  canonical  dual  system.  Many  of  the  results  in  this  section 
can  be  found  in  [19]  by  Ron  and  Shen,  However,  the  presentation  of  the  results  we  give 
here  is  rather  different  from  the  one  in  [19],  and,  for  instance,  the  results  on  frame  operator 
representation  as  well  as  those  on  the  characterization  and  computation  of  canonical  dual 
systems  cannot  be  found  in  [19],  at  least  not  in  the  form  we  present  them  here.  For  full 
details  and  proofs  we  refer  to  [27],  Sec.  1.2. 

We  consider  =  L‘^{R)  with  the  standard  inner  product  and  norm 

oo 

(3.1)  (/,  h)=  I  f(t)  A*(«)  dt ;  Wff  =  (/,  f),  f,  he  L^. 

—  OO 

Furthermore,  we  denote  for  f  e  by  f  -  Tf  the  Fourier  transform  of  /,  given  as 

00 

(3.2)  f(u)  =  (Tf){u)  =  J  fit)  dt ,  a.e.  t/  €  M  . 

— OO 

With  /„  e  L\  m  £  define  the  “matrices” 

(3.3)  :=  -  k/a))kez,mez  ,  a.e.  u  eR, 

whose  “row”  consists  of  the  sample  values  gmii'  -  k/a),m  £  Z. 

Theorem  1.  The  system  gnm,  n,  m  €  Z,  has  a  finite  frame  upper  bound  Bg  if  and  only 
ifHg{u)  and  H*{u)  define  for  a.e,  u  eRa  bounded  linear  operator  off{Z)  with  operator 

norm  ^  (a  Bg)^.  In  particular,  there  then  holds  for  k^m  eZ 

(3.4)  ^  ^  ~  k/ (1)1  ^  d  Bg  ,  ^  ]  \9m{^  ~  k/a)  j  ^  a  Bg 

m  k 

fora.e.  u  gR. 

Theorem  2.  Let  0,  B  <oo.  Then  we  have 

(3.5)  A\\f\f  ^  Y,  l(/-5nm)P  <  BII/IP  ,  /  e 

n,m 


AI^-  HAv)  H'M  ^  BI ,  a.e.  e 


if  and  only  if 
(3.6) 
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where  the  I  in  (3,6)  denotes  the  identity  operator  ofP{1j), 

We  observe  that  Hg{iy)  H*(j/)  is  given  as  the  “matrix” 

(3.7)  H;(u)  =  -  k/a)  gl{<'  -  «/«))  ,  a.e.  ueR, 

m  ’ 

and  that  for  all  j,  A:,  /  €  Z  there  holds 

(3.8)  {H,{u  -  l/a)  H;{u  -  l/a))j,  =  {H,{y)  ,  a.e.  1/  6  R  . 

Hence  for  checking  (3.6)  it  is  sufficient  to  consider  in  an  interval  of  length  1/a. 

Also  note  that  the  system  n,  m  €  Z,  is  a  tight  frame  if  and  only  if  there  is  a  constant 
c  such  that  for  a.e.  i/  €  M 

(3.9)  ^  Qmiv  -  k/a)  g*^{v  -  l/a)  =  c6ki ,  kj  el. 

m 

Theorem  3.  Assume  that  the  system  gnm>  n,m  e  h,  has  a  finite  frame  upper  bound  Bg, 
and  let  f  e  L^.  Then  we  have,  with  Sg  the  frame  operator, 

(3.10)  Sgf{i/)  =  -  V)  dk{v)  f(i/  -  k/a) ,  a.e. !/  e  R  , 

“  k 

with  absolute  convergence  of  the  right-hand  series  for  a.e.  u  eU.  Here 

(3.11)  dt{u)  =  {Hg(u)H;{i^))ok 

=  ~  s  ®  >  fc  e  z . 

m 

Because  of  (3.8)  there  holds,  more  generally,  for  f  e  L?  and  a.e.  u  eR 

(3.12)  -  IH  =  -  E  e  2  • 

^  k 

This  gives  a  frame  operator  representation  in  the  Fourier  domain  in  terms  of  the  matrices 
Hg{i/)  where  L^{R)  is  identified  with  L^{[0,  l/a)  x  Z).  The  relation  (3.12)  can  be 
extended  as  follows.  Let  Ag  be  a  frame  lower  bound  for  the  system  g^j^,  n,  m  G  Z,  and  let  ip 
be  a  function  analytic  in  an  open  set  containing  the  closed  segment  [Ag,  Bg],  Then  we  have 
for  f  e  L’^  and  a.e.  i/  eR 

-  l/a)  =  E  (v(  J  Hgiu)  H;{u)))^^  }(u  -  k/a)  ,leZ. 


(3.13) 
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In  particular,  when  Ag  >  0,  so  that  n,  m  e  Z,  is  a  frame,  the  choice  ip{x)  =  x  ^  yields 
a  representation  of  the  inverse  of  the  frame  operator  Sg  according  to 

(3.14)  S^f{iy-l/a)  = 

=  ;ez, 

k 

for  f  G  L?  and  a.e.  G  M.  Specialization  of  (3.15)  to  the  case  f  =  gm^  where  m  G  Z,  and 
I  =  0  yields  for  a.e.  u  eM. 

(3.15)  °7m(t^)  =  =  E  -  k/a) . 

\  (X  / 

k 

Theorem  4,  Assume  that  the  systems  n,  m  G  Z,  and  fnm,  n,m  e  Z,  have  finite 
frame  upper  bounds.  Then  the  two  systems  are  dual  in  the  sense  of  (2.2)  if  and  only  if 

(3.16)  Hg{u)  H*{u)  =  H*{p)  =  al ,  a.e.  i/  G  R  , 

if  and  only  if 

(3.17)  =  f^lm{v-kla)g'^{u) 

m  m 

=  aSko  ,k  e  Z ,  a.e.  u  eR  . 

Note  that  (3.16)  says  that  a~^H*{u)  is  a  right-inverse  of  Hg(i/)  and  that  a-^H.y{u)  is  a 
left-inverse  of  H*{u)  for  a.e.  u  eR  The  next  theorem  shows  that  the  canonical  dual  system 
°7nm»  n,  m  G  Z,  is  special  in  the  sense  that  is  “the”  generalized  inverse  of  Hg{i/) 

for  a.e.  u  eR 

Theorem  5.  Assume  that  gnm>  n,meZ,  is  a  frame.  Then 

(3.18)  ,  a.e.  i/  e  E  , 
and 

(3.19)  (i  H^{v)  =  (i  H,{v)  ,  a.e. 

Theorem  5  can  be  made  more  explicit  for  the  purpose  of  calculating  the  canonical  dual 
functions  as  follows. 

Theorem  6.  Assume  that  the  system  g-am,  n,  m  G  Z,  is  a  frame,  and  denote  by  c{u)  G 
f{Z)for  a.e.  u  eR  the  least-norm  solution  c  =  (cm)mGZ  of  the  linear  system 

(3.20)  E  -  kja)  Cm  =  a5ko,  kel. 


Then  there  holds 

(3.21) 


°7mM  =  ,  m  €  Z ,  a.e.  i/  e  E  . 
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As  a  consequence  of  Theorem  6  we  have  the  following.  Assume  that  gnm,  n.me’L,  and 
7nm>  n,m  eZ,  are  dual  frames.  Then  we  have 

(3.22)  ^  rW')?  ^  E  ’  a-e-  ^  e  R  , 

m  m 

with  equality  if  and  only  if  °7^  =  7^  a.e. 

4.  Gabor  systems  as  shifMnvariant  systems 

In  this  section  we  specialize  the  results  of  Sec.  3  to  the  case  of  a  Gabor  system  (g,  a,  6), 
so  that  we  have  now  ag  e  L‘^  and  a  6  >  0  such  that 

(4.1)  grr^it)  =  ,  m  €  Z  ,  f  G  M  ; 

as  already  said  it  is  customary  in  Gabor  theory  to  consider  gna,mb  rather  than 
gnm  =  exp{2'Kimb{-  -  na))g{‘  -  na). 

This  amounts  to  dropping  the  phase  factors  exp{~27rinmab).  Since  {g,  a,  b)  is  a  Gabor  sys¬ 
tem  if  and  only  if  (§,  6,  a)  is  a  Gabor  system,  we  have  two  ways  of  specialization  of  the 
results  of  Sec.  3,  yielding  a  description  of  the  various  notions  and  conditions  in  the  frequency 
domain  and  the  time  domain,  respectively. 

4.1.  Frequency  domain  results 

We  first  show  what  form  the  results  of  Sec.  3  take  when  we  choose  the  gm  as  in  (4.1). 
This  then  yields  a  description  of  the  frame  bound  conditions,  the  frame  operator,  the  duality 
condition  and  a  characterization  of  and  a  computation  method  for  the  canonical  dual  function 
°7.  Since  now 

(4.2)  =  (:Fpm)(i/)  =  g(u  -  mb) ,  m  &  Z ,  a.e.  i/  e  R  , 
the  “matrix”  Hgiy)  in  (3.3)  is  given  by 

(4.3)  Hg(u)  =  (g{f-mb-k/a))ks7.,m&, 

a.e.  u  eR , 

and  the  “matrix”  Hg(u)  in  Theorem  2  is  given  by 

oo 

(4.4)  g(u  -  mb  -  k/a)  g*{i/ -  mb  -  l/a)\ 

a.e.  u  eR. 

Hence,  Theorem  1  gives  that  {g,  a,  b)  is  a  Gabor  system  with  a  finite  frame  upper  bound  Bg  if 
and  only  if  Hg{u)  in  (4.4)  and  H*{i/)  define  for  a.e.  u  eRa  bounded  linear  operator  of  P(Z) 
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with  operator  norm  <  (a  2 ;  in  particular,  we  then  have  that 

(4.5)  ^  \g{v  -  mb)f  ^aBg  ,  ^  {gii/  -  k/a)\‘^  ^aBg  ,  a.e.  i/  G  M  . 

m  k 

Furthermore,  from  Theorem  2  we  see  that  {g,  a,  6)  is  a  Gabor  frame  with  frame  bounds  ^  >  0, 
B  <  00  if  and  only  if  the  matrices  Hg{u)  in  (4.5)  satisfy 

(4.6)  AI  Hg{v>)  HUi/)  ^  BI ,  a.e.  u  eM  . 

a  ^ 

Moreover,  by  Theorem  3  the  frame  operator  of  the  Gabor  frame  {g,  a,  b)  has  the  representation 

(4.7)  f('^  -  */“)  >  a-e-  €  E  , 

“  k 

for  f  e  with  absolute  convergence  of  the  right-hand  side  series  for  a.e.  1/  €  M,  in  which 
the  dkiif)  are  given  by 

(4.8)  dk{v)  =  gil'  ~  mb)  g*{i/  -  mb  -  k/a)  ,  k  a.e.  1/  . 

m 

Also,  tightness  of  the  frame  (5,  a,  6)  is  equivalent  with 

(4.9)  ^  g{v  —  mb  —  k/a)  g*{u  —  mb  -  l/a)  =  cSki  ,  kj  elj 

m 

for  a.e.  i/  eR  with  c  some  constant.  And  duality  of  two  Gabor  systems  {g,  a,  b)  and  (7,  a,  b) 
having  a  finite  frame  upper  bound  is  equivalent  with 

Y2g{i'-mb-k/a)j*{iy-mb)  =  ^  -  mb  -  k/a)  g*(j/ -  mb) 

m  rn 

=  a6)i{) ,  ^ ,  a.e.  €  ]R  . 

Finally,  the  canonical  dual  functions  °7m  =  exp(27r2m6  )  °7  can  be  found  by  using  that 

(4.10)  ''7(1/)  =  Hg(v)  9{^  -  k/a)  ,  a.e.  i/  e  E  . 

k 

This  canonical  dual  function  °7  is  minimal  in  the  sense  that  for  any  dual  frame  {7,  a,  6)  we 
have  for  a.e.  i/  G  E  that 

(4.11)  ^  |°7(i/-m6)p  |7(i/-m6)|^ 

m  m 

with  equality  if  and  only  if  —  mb)  =  j(i/  —  mb),  m  G  Z.  When  we  integrate  (4.1 1)  over 
an  interval  of  length  b  and  use  Parseval’s  theorem,  we  obtain 
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with  equality  if  and  only  if  °7  =  7  a,e.  Hence  the  canonical  dual  °7  has  the  least  -norm 
among  all  dual  functions. 

4.2.  Time-domain  results 

We  next  show  how  the  results  of  Sec.  3  can  be  applied  to  yield  a  description  of  the  frame 
bound  conditions,  the  frame  operator,  the  condition  of  duality,  and  a  characterization  of  and 
a  computation  method  for  the  canonical  dual  “7  in  the  time  domain.  To  do  so,  we  note  that 

(4.13)  [9na,mb]  (v)  =  {v),n,m&Z,  a.e.  ueR. 

Here  denotes  the  inverse  Fourier  transform  and  g  is  the  inverse  Fourier  transform  of  g, 
so  that 

00 

(4.14)  g{u)  =  {J^-^g){u)  =  j  9{t)  dt ,  a.e.  €  E  . 

-C» 

Furthermore,  we  have  for  f  e 

(4.15)  {f^9na,Tnb)  —  ^  (/ j  {9)-mb,na)  j  71,771  G.  ^  . 

Consequently,  when  the  system  {g,  a,  b)  has  a  finite  frame  upper  bound  or  a  positive  frame 
lower  bound,  then,  by  Parseval’s  theorem,  so  has  the  system  {g,b,a),  and  the  respective 
frame  bounds  can  be  taken  equal.  And,  in  the  case  of  finite  frame  upper  bounds,  the  two 
frame  operators  are  related  according  to 

(4.16)  5(,,„,i)/  =  S^f  ,feLK 

Furthermore,  the  two  Gabor  systems  {g,  a,  b)  and  (7,  a,  b)  are  dual  if  and  only  if  the  systems 
{g,  6,  a)  and  (7, 6,  o)  are  dual.  Also,  when  (g,  a,  b)  is  a  Gabor  frame  with  canonical  dual 
Gabor  frame  (“7,  a,  b),  then  (g,  6,  a)  is  a  Gabor  frame  as  well  with  canonical  dual  ((°7);  6,  a) 
(for  the  latter  fact  we  have  used  (4.16)  together  with  {hy  =  h  for  he  L?). 

Accordingly,  we  consider  now  the  “matrix” 

(4.17)  Mg{t)  :=  {g{t  -na-  i/6))i€Z,nGZ  ,  a.e.  teR, 

instead  of  the  “matrix”  Hg{u)  in  (3.3)  and  (4.4),  and  in  Theorem  2  and  (4.5)  we  have  now  the 
“matrix” 

00 

(4.18)  Mg{t)Mg{t)  =  (  X]  9{t-na-i/b)9'‘{t-na-jfb/j_^^, 

n=— 00  ’ 

a.e.  t  eR , 

instead  of  the  matrix  Hg{v)  Theorem  1  gives  that  {g,  a,  b)  has  a  finite  frame  upper 

bound  Bg  if  and  only  if  Mg(t)  in  (4.17)  and  M*{t)  define  for  a.e.  t  e  Ro.  bounded  linear 
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operator  of  P  (Z)  with  operator  norm  ^  in  particular,  we  then  have  that 

(4.19)  ^  -  i/6)|'  ^bB,,  a.e.  i  e  R  . 

n  i 

Furthermore,  by  Theorem  2  we  see  that  {g,  a,  b)  is  a  frame  with  frame  bounds  >  0,  B  <  oo 
if  and  only  if 

(4.20)  i  Mg{t)  M;(t)  ^  BI ,  a.e.  <  e  R  . 

Next,  by  Theorem  3  the  frame  operator  of  the  Gabor  frame  (g,  a,  6)  has  the  representation 

(4.21)  iS(g,a,b)fW)  =  ^  ~  */**)  ’  *  e  K  , 

t 

for  f  e  L'^  with  absolute  convergence  of  the  right-hand  side  series  for  a.e.  t  e  E,  in  which 
the  Cj  (t)  are  given  by 

(4.22)  ei{t)  =  ^  g(t  -  na)  g*(t  -  na  -  i/b)  ,  i  eZ,  a.e.  t  gR  . 

n 

Also,  tightness  of  the  frame  {g,  a,  b)  is  equivalent  with 

(4.23)  '^g(t-na-  i/b)  g‘{t  -na-  j/b)  =  c  5ij  ,  i,j  eZ, 

n 

for  a.e.  t  G  R  with  c  some  constant.  Moreover,  the  duality  condition  between  two  Gabor 
frames  {g,  a,  b)  and  (7,  a,  b)  can  be  expressed  as 

g{t  -na-  i/b)  -  na)  -  f{t  -  na  -  i/b)  g*{t  -  na) 

n  n 

(4.24)  b  6iQ ,  i  €  Z  ,  a.e.  ^  G  E  . 

Finally,  the  canonical  dual  function  can  be  computed  as 

(4.25)  ’’yit)  =  '£{1  -  *■/(') .  a-e-  <  e  R  , 

i 

and  this  is  minimal  in  the  sense  that  for  any  other  dual  function  7  we  have  that  ||°7|p  ^ 
II7IP  with  equality  if  and  only  if  °7  —  7  a.e. 

We  note  that  the  representation  (4.21-4.22)  of  the  frame  operator  Sg  (with  shift  param¬ 
eters  a,  b)  is  called  the  Walnut  representation  [21]  of  the  frame  operator.  Note  that  this 
representation  holds  for  any  /  6  with  absolute  convergence  for  a.e.  t.  A  detailed  study 
of  the  convergence  of  the  right-hand  side  of  (4.21)  as  an  operator  of  has  been  carried  out 
in  [28].  A  sufficient  condition  that  the  Gabor  system  has  a  finite  frame  upper  bound  while  the 
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representation  (4.21)  converges  unconditionally  is  that  g  satisfies  the  CC-condition:  there  is 
an  M  <  oo  such  that 

(4.26)  ^  |ej(<)l  ^  M,  a.e.t  sR, 

i 

with  the  Ci  given  in  (4.22),  see  [28],  Theorem  4.1  and  6.9. 

5.  Gabor  systems  in  the  time-frequency  domain 

In  this  section  we  consider  Gabor  systems  {g,  o,  b)  with  g  £  L?  and  a  >  0,  ft  >  0  in 
the  time-frequency  domain.  We  shall  thus  obtain  a  description  in  the  time-frequency  domain 
of  the  frame  bound  conditions  and  the  frame  operator,  of  the  duality  condition  and  of  the 
canonical  dual  function  °7.  We  define  time-frequency  shift  operators  Uki  for  A;,  ^  G  Z  by 

(5.1)  Uki  h  =  hk/b,i/a  ,  he  . 

The  proofs  of  the  main  results  in  this  section  can  be  found  in  [23],  [27],  Sec.  1.4,  while  many 
of  these  main  results  can  also  be  found  in  [20],  [29].  It  should  be  noted  that  the  approaches 
used  in  [23],  [27],  Sec.  1.4  and  in  [20]  and  in  [29]  are  quite  different;  indeed,  [20],  [23] 
and  [29]  were  written  independently  of  one  another  and  more  or  less  simultaneously.  We 
follow  here  the  the  approach  in  [23],  [27],  Sec.  1.4  which  is  based  upon  what  we  call  the 
Fundamental  Identity,  This  identity  can  be  traced  back  to  the  work  of  Tolimieri  and  Orr  [22], 
and  the  sharp  form  that  we  present  below  is  due  to  Janssen,  [30],  Proof  of  Prop.  A,  [23], 
Props.  2.3  and  2.4,  [27],  Subsec.  1.4.1. 

Theorem  7  (Fundamental  Identity).  Let  and  assume  that  at 

least  one  of  the  systems  (  f^‘^\a,b)  and  at  least  one  of  the  systems  {f^^\a,b), 

a,  b)  has  a  finite  frame  upper  bound.  Also  assume  that 

(5.2) 

k,l 

Then 

(5.3)  ^  • 

n,m  k,l 

The  proof  of  this  result  consists  of  a  careful  inspection  of  the  proof  of  the  Poisson  sum¬ 
mation  formula  for  functions  of  two  variables  and  their  2D-Fourier  transforms  where  the 
2Z7-Fourier  transform  pair 

(5.4)  /*'>); 

{v,w) 


is  taken. 
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Now  let  g  G  and  define  the  linear  mapping  Ug  of  by 

^gf  =  ((fi9k/b,l/a))k,lel.  ,  f  ^  ‘ 

Theorem  8.  The  Gabor  system  {g,  a,  h)  has  a  finite  frame  upper  bound  Bg  if  and  only 
if  Ug  and  U*  are  bounded  linear  mappings  of  into  B  ij?)  and  B  (Z^)  into  L^,  respectively, 

with  operator  norms  ^  {abBg)i,  In  particular,  the  Gabor  system  {g,  1/6, 1/a)  then  has  the 
finite  frame  upper  bound  abBg, 

Note  that  the  mapping  U*  is  given  by 

(5-6)  =  ^  ^  • 

k,l 

Theorem  9.  Let  A  ^  0,  B  <  oo.  Then  we  have 

(5.7)  A II/IP  <  l(/-5na,m6)|^  <  B  ll/f  ,  /  G 

n,m 

if  and  only  if 

(5.8)  AI  ^  ^  BI , 

where  I  is  now  the  identity  operator  of  That  is,  (g,  a,  b)  is  a  frame  if  and  only  if 

(p,  1/a,  1/6)  is  a  Riesz  basis  for  its  linear  span. 

We  observe  that  Ug  U*  maps  /^(Z^)  into  B{Z^)  (when  (5.8)  holds),  with  matrix  elements 
given  by 

(5-9)  {Ug  U*)kj.k',i'  =  (gk>ib,i'/a,  gk/b,i/a)  ,  kj  eZ;  k\l’  gZ  . 

Hence,  the  frame  upper  bound  conditions  and  tightness  of  the  Gabor  frame  {g,  a,  b)  can  be 
read  off  from  the  operator  Ug  U*  whose  matrix  elements  are  given  in  (5.9).  In  particular, 
(^,  a,  6)  is  a  tight  frame  if  and  only  if 

(5-10)  {9k'/b,i'/a,gk/b,i/a)  =  cSkk>  ^iv  ,  k,l  eZ]  k'J'eZ, 

for  some  constant  c. 

We  next  give  a  result,  Theorem  10  below,  on  frame  operator  representation.  We  first 
introduce  a  norm-preserving  mapping  of  into  L^(Z^  x  [0, 6"^)  x  [0,  a"^)).  Let  h  be  any 
member  of  Schwartz  space  S  with  \\h\\  =  1.  Then  the  mapping  STFTh,  defined  for  f  G  L? 
by 

(5.11)  {STFTkf){x,y)  =  {f,h,^y),y^R, 

is  a  norm-preserving  mapping  of  into  L'^{R^).  Now  there  holds,  see  (5.1),  for  /  e  L^ 
k,l  eZ  and  x,y  eR  that 

(5.12)  (U,J,  h,,y)  =  if,  . 
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Hence  the  mapping  I4,  defined  for  /  6  by 

(5.13)  {VHf){k,l-,x,y)  =  k,leZ; 

X  6  [0,6"^),  y  e  [0,a"')  , 
is  a  norm-preserving  mapping  from  into  x  [0, 6“^)  x  [0,  a“^)). 

Theorem  10.  When  the  system  {g,  a,  b)  has  a  finite  frame  upper  bound  Bg,  we  have  for 

{V^s,f){;  ■■,x,y)  =  i  (U,u;f  (Vnf)i;  ■■,x,y), 

(5.14)  x6[0,6-'),j/e[0,a-'), 
where  {UgUgY  is  the  transpose  of  the  ‘"matrix”  UgU*  in  (5.9). 

Note  that  in  this  representation  of  the  frame  operator  Sg  the  matrix  {ab)~'^  {UgU*)'^  is 
independent  of  x,y.  The  relation  (5.14)  extends  as  follows.  Assume  that  Ag  is  a  frame  lower 
bound  of  the  Gabor  frame  (^,  a,  6)  and  that  ip  is  analytic  on  an  open  set  containing  the  closed 
segment  [Ag^Bg].  Then  (5.14)  holds  with  Sg  at  the  left-hand  side  replaced  by  ip{Sg)  and 
(ab)~^  {UgUg)'^  at  the  right-hand  side  replaced  by  ip({ab)~^  (Ug^gV)- 

The  representation  in  Theorem  10  can  be  rephrased  more  loosely  as  follows.  Assume  that 
the  Gabor  system  (p,  a,  b)  has  a  finite  frame  upper  bound  Bg.  Then  the  frame  operator  Sg  has 
the  representation 

(5.15)  *^3  “  ^  X]  9k/b,l/a)  Ukl 

k,l 

in  the  sense  that  for  any  f^h  £  such  that  {{Ukif,  h))k,iez  €  /^(Z^)  there  holds 

(5.16)  {SJ,  i9,9k/»ia){Ukif,  h) . 

k,l 

In  the  case  that  g  satisfies 

(5.17)  condition  A  :  E-.=  y2  l(P>5w/o)l  <  0° 

k,l 

of  Tolimieri  and  Orr  [22],  Sec.  3,  the  system  (p,  a,  b)  has  the  finite  frame  upper  bound  E/ab, 
and  the  convergence  in  (5.15)  is  without  any  proviso. 

Theorem  11.  Assume  that  the  systems  (5,0,6)  and  (7,0,6)  have  finite  frame  upper 
bounds.  Then  the  two  systems  are  dual  if  and  only  if 

(5.18)  UgU;  =  U^U*g  =  abl . 

Moreover,  we  have  for  the  canonical  dual  °j  that 

(5.19)  ut,  =  cd)U;(u,u;)-\ 
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The  duality  condition  in  (5.18)  can  be  made  more  explicit  as  follows.  We  have  that  two 
Gabor  systems  (^,  a,  b)  and  (7,  a,  6),  both  with  a  finite  frame  upper  bound,  are  dual  if  and 
only  if 

(5.22)  {l^9k/b,i(a)  =  O'bdko^iQ  ,  k,l  eZ  . 

This  is  a  rigorous  form  of  the  celebrated  Wexler-Raz  biorthogonality  condition  [24].  Also, 
(5.19)  can  be  made  more  explicit  as  follows.  We  have  that  is  the  unique  element  7  G  of 
minimum  norm  such  that  (5.22)  holds.  The  latter  result  has  become  known  as  the  “Wexler- 
Raz  dual  equals  the  frame  duaf’-result. 


6.  Gabor  systems  in  the  Zak  transform  domain 

We  consider  in  this  section  Gabor  systems  (p,  a,  b)  for  the  special  case  that  {ab)~^  ~  q/p 
with  integer  q  and  p  satisfying  gcd{q,  p)  =  1. 

Let  A  >  0.  We  define  for  h  e  L?  the  Zak  transform  Zxhoih  by 

00 

(6.1)  {Zxh)[t,  v)=\2  -  k))  ,  a.e.  <,  1/  e  R  . 

k~-oo 

The  following  properties  hold  for  the  Zak  transform.  Let  /, /i  G  Then  Zf,Zh  G 
quasi-periodic  according  to 

(6.2)  F{t  -\-\^p)  =  F{t,  u) ;  F(t,  v  -\-l)  =  F{t,  v) ,  a.e.  t,  G  M  , 
and  there  holds 

(6.3)  (/,/i)  =  (Z/,Z/i), 

where  the  inner  product  on  the  right-hand  side  involves  any  unit  square  in  R^.  Furthermore, 
any  F  G  satisfying  (6.2)  is  of  the  form  F  —  Zf  with  some  unique  f  e  Some  other 
properties  are 

A^  /(At)  =  J  (Zxf){t,  u) ,  a.e.  t  G  R  , 

0 


(6.4) 
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and 

(6.5)  I/)  =  t) ,  a.e.  t,ueR. 

Finally,  any  continuous  F  satisfying  (6.2)  has  a  zero  in  any  unit  square  in  See  [31]  for 
these  and  many  more  properties  of  the  Zak  transform. 

The  usefulness  of  the  Zak  transform  for  description  of  frame  bound  conditions,  frame 
operator,  the  duality  condition  and  characterization  and  computation  of  the  canonical  dual 
function  was  recognized  and  elaborated  by  Zibulski  and  Zeevi,  see,  for  instance,  [25].  Also 
see  [26],  pp.  978  and  981  and  [32].  We  make  the  choice  A  =  b~^  and  suppress  the  subscript 
A  in  Zx  so  that 

(6.6)  iZh)  (t, !/)  =  6-  ?  y]  ’  a-e-  e  »  . 

k~~oo 

for  h  e  See  [27],  Subsec.  1.5.7,  where  it  is  shown  that  the  choice  A  =  a  yields  equally 
useful  results. 

We  set  for  f,h  e  L‘^  and  a.e.  G  R 

(6.7)  ^f(t,u)=p-i((Zf){t-l^^,u  +  ^))  . 

( =  0, . . .  -  1 

(6.8)  Af'it,  u)  =  I/))* . 

Theorem  12.  The  Gabor  system  {g,  a,  b)  has  a  finite  frame  upper  bound  Bg  if  and  only 
if^^{t,  v)  and  i'))*  are  for  a.e.  t,v  &  [0, 1)  bounded  linear  mappings  ofO  into  O’ 

I 

and  CP  into  CP,  respectively,  with  norm  ^  Bg.  In  particular,  there  holds 

(6.9)  I  {Zg)  (t,  i/)\^  ^pBg  ,  a.e.  eR. 

We  note  here  that  it  thus  follows  that  {g,  a,  b)  has  a  finite  frame  upper  bound  if  and  only 
if  Zg  is  essentially  bounded. 

Theorem  13.  Let  A  ^  0,  B  <  oo.  Then  we  have 

(6.10)  ^ll/lP^El(•^'W6)l^<-B||/||^/€i^ 

n,m 

if  and  only  if 

(6.11)  Alpyp  <  <  BIpxp  ,  a.e.  t,i/  eR, 

where  Ipxp  denotes  the  identity  operator  o/O’. 
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We  observe  that  one  can  restrict  oneself  in  checking  the  condition  (6.11)  to  the  set  {t,  v)  6 
[0,  q~^)  X  We  also  note  that  the  Gabor  frame  (p,  a,  b)  is  tight  if  and  only  if 

(6.12)  =  clp^p  ,  a.e.  eR, 
for  some  constant  c. 

Theorem  14.  Assume  that  the  Gabor  system  {g,  a,  b)  has  a  finite  frame  upper  bound. 
With  Sg  the  frame  operator,  there  holds  for  f  €  L'^ 

(6.13)  u)  =  u)  ^^{t,  v)  ,  a.e.  f,  i/  €  H  . 

Theorem  14  gives  the  representation  of  the  frame  operator  Sg  in  the  Zak  transform  domain 
via  the  matrices  in  (6.7).  More  generally,  when  Ag  is  a  frame  lower  bound  for  (g,  a,  b) 
and  ip  is  analytic  in  an  open  set  containing  the  closed  segment  [Ag,  Bg],  we  can  replace  Sg 
at  the  left-hand  side  of  (6.13)  by  ip{Sg)  and  A^^(t,u)  at  the  right-hand  side  of  (kl3)  by 

t/)). 

Theorem  15.  Assume  that  the  systems  (p,a,  6)  and  (7,  a,  6)  have  finite  frame  upper 
bounds.  Then  the  two  systems  are  dual  if  and  only  if 

(6.14)  u)y  -  u)y  =  Ip^p  ,  a.e.  fveR. 

Moreover,  we  have  for  the  canonical  dual  that 

(6.15)  u)y  =  u)r  ($’(<,  u)  (4><'(t,  J/))*)-'  ,  a.e.  i,  ^  e  K  , 
and 

(6.16)  AL'^°'^[t,  v)  =  1^))“^  ,  a.e.  eR  . 

The  condition  of  duality  can  be  written  more  explicitly  as  follows.  The  systems  {g,  a,  b) 
and  (7,  a,  6),  both  having  a  finite  frame  upper  bound,  are  dual  if  and  only  if  for  a.e.  £R 
we  have 

(6.17)  -  ^  {Zg)(t  -  I  -  +  ~^{Z'yy(t  -  I  -  =  Sko ,  k  =  0,  ...,p-  1  . 

P  Q  P  Q 

Furthermore,  for  any  dual  system  (7,  a,  b)  and  a.e.  f ,  1/  e  R  we  have 

(6.18)  +  ^  + 7’'')!  ’ 

k=0  ^  fc=0  ^ 

with  equality  if  and  only  if  (Z°j)(t  k/q,  u)  {Z'y)[t  -1-  fc/g,  i/)  for  A:  =  0, ^  -  1. 
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7.  When  is  {g^  a,  b)  a  Gabor  frame? 

In  this  section  we  present  a  collection  of  results,  comments,  observations,  (counter)ex- 
amples,  open  problems,  etc.,  on  the  basic  problem  of  deciding  whether  a  triple  {g,  a,  b)  with 
g  £  L?  and  a  >  0,6  >  Oisa  frame.  While  the  finite  frame  upper  bound  condition  is 
reasonably  easy  to  deal  with  by  imposing  rather  mild  smoothness  and  decay  conditions,  the 
positivity  of  the  frame  lower  bound  presents  a  much  harder  problem.  The  basic  problem  can 
be  considered  in  each  of  the  four  domains  of  Secs.  4~6,  and  any  one  of  these  domains  can 
come  with  a  particular  advantage.  By  nature,  this  section  is  not  as  well  organized  as  the 
preceding  sections.  Other  authors  might  well  have  chosen  to  include  different  specific  topics. 

7.1. 

We  start  with  the  well-known  result  that  when  g  e  L'^  and  (^f,  a,  6)  is  a  frame  then  we 
must  have  (a6)"^  ^  1.  This  result  has  a  long  and  complicated  history,  see  for  this  [16], 
Sec.  4.1,  [26],  p.  978,  [30],  Sec.  1.  There  is  a  stronger  result,  due  to  Howe  and  Steger  using 
results  of  Rieffel  in  [33],  that  says  that  completeness  of  the  system  {g,  o,  6)  in  implies  that 
(a6)~^  ^  1.  In  [34],  Sec.  2,  Benedetto,  Heil  and  Walnut  present  a  somewhat  unsettling  exam¬ 
ple  ofageL'^  and  an  irregular  Gabor  system  (i.e.  the  time-frequency  points  involved  in  the 
shifts  do  not  form  a  lattice)  of  arbitrarily  low  density  such  that  the  system  is  complete  in  L^, 
but  not  a  frame.  Restricting  again  to  frames,  several  proofs  of  the  fact  that  (a6)“^  ^  1  when 
(5,  a,  6)  is  a  frame  are  known  now.  When  (ab)"^  =  q/p  is  rational  with  integer  q,  p  such  that 
gcd(^,  p)  =  1,  a  simple  rank  consideration  of  the  matrices  in  (6.7)  and  (6.1 1)  suffices  to  show 
thatp  ^  q,  i.e.  (ab)~^  ^  1.  For  general  values  of  (tt6)~^  a  simple  proof  can  be  based  upon 
the  Wexler-Raz  biorthogonality  condition  (5.22)  together  with  the  inequality  |  (p,  °7)  1  ^  1  that 
follows  from  (2.14).  Yet  another  proof  follows  upon  integrating  the  duality  condition  (4.10) 
for  the  canonical  dual  °j  over  an  interval  of  length  6  and  using  (as  in  the  previous  proof)  that 
I  (5/7)  I  ^  1-  It  is  the  latter  approach  that  can  be  generalized  to  shift-invariant  systems,  the 
windows  of  which  have  certain  frequency  localization  properties,  see  [35]. 

7.2. 

We  now  consider  the  case  that  (ab)~^  —  1.  It  is  particularly  convenient  to  discuss  this 
case  in  the  Zak  transform  domain  since  now  we  have  p  =  q  =  lm  {ab)~^  —  q/p,  so  that  the 
matrices  in  (6.7-6.8)  reduce  to  scalars.  As  already  said  in  Sec.  6,  when  Zg  is  continuous  it 
must  have  a  zero  in  any  unit  square.  Hence,  when  ab  =  I  and  (g,  a,  6)  is  a  frame,  it  cannot 
be  true  that  g  is  continuous  and  rapidly  decaying  (for  then  Zg  is  continuous  and  has  a  zero, 
whence  the  lower  frame  bound  in  (6.11)  is  zero).  Another  result  for  the  case  (a6)“^  =  1 
is  the  Balian-Low  theorem  (for  the  complicated  history  of  this  result,  see  [34],  Subsec.  1.1), 
according  to  which  at  least  one  of  g*{t)  and  tg{t)  is  not  square  integrable  as  a  function  of 
f  G  M  when  (p,  a,6)  is  a  frame.  Somewhat  surprisingly,  there  is  a  construction  involving 
cosines  and  sines  rather  than  exponentials  where  one  does  get  a  frame  at  critical  density 
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{ab)~^  =  1  with  a  well-behaved  window  g  (Wilson  bases,  see  [16],  Subsec.  4.2.2  for  history 
and  details  of  the  construction). 

In  the  remainder  of  this  section  we  shall  consider,  with  few  exceptions,  the  case  that 
(a6)“^  >  1. 

7.3. 


We  shall  first  indicate  a  class  of  windows  g  such  that  [g,  a,  b)  is  a  Gabor  frame.  To  that 
end  we  choose  a  continuous  g,  positive  on  and  vanishing  outside  an  interval  (-|c,  ^c),  where 
c  is  any  number  in  the  non-empty  interval  (a,  6“^).  Now  the  “matrix”  Mg{t)  M*{g)  in  (4.19) 
is  a  diagonal  matrix  with  strictly  positive  diagonal  elements 

(7.1)  D(i)  =  ^  |a(i -no)|^  ,  <  e  R  , 

n 

that  are  bounded  away  from  0  and  oo.  Hence  by  (4.20)  there  are  the  frame  bounds  b~^  min  D, 
b~^  max  D.  One  easily  sees,  furthermore,  that  a  tight  frame  (h,  a,  b)  is  obtained  by  choosing 
h  =  glDi. 

7.4. 


There  are  a  few  cases  where  one  can  show  that  a  system  (g^  a,  b)  with  a  finite  frame  upper 
bound  is  a  Gabor  frame  by  displaying  a  dual  function  7  such  that  {7,  a,  b)  has  a  finite  frame 
upper  bound.  This  is  so,  for  instance,  for  the  case  of  the  Gaussian  window 

(7.2)  ga(t)  =  (20) 5  exp(— 7rai^)  ,  t  €  E  , 

and  for  the  one-sided  exponential  considered  in  7.5  below.  In  [30],  Sec.  3,  the  Wexler-Raz 
biorthogonality  condition  (5.22)  for  g  g^  h  written  out  and  elaborated  to  yield,  for  any 
e  >  0  with  £  <  —  a6),  the  biorthogonal  function  (apart  from  a  constant  factor) 

(7.3)  7.,a(t)  =  (-l)^exp(^(l:+i-ftc5)')<i«, 

f  €  M  , 

where  c  =  {ae  +  ab)~^  >  1.  It  can  be  shown  that  this  can  be  extended  to  an  entire 
function  of  t  =  x-\-iy  e  C  satisfying 

(7.4)  7e,a(^  +  iy)  —  0{exp(-7Tax‘^  7t£~^y^))  ,  x,y  eR  . 

This  implies  that  both  and  have  Gaussian  decay,  just  like  g^  and  ga-  Such  a  7^,0  can 
also  be  constructed  by  using  the  Bargmann  transform,  see,  for  instance  [7],  [8].  Interestingly, 
when  0  =  1  and  we  take  e  |  0  we  obtain  a  function  70,1  that  would  coincide  with  Bastiaans 
singular  function  in  [4]  when  ab  =  1.  The  latter  singular  function  can  be  shown  to  be  in  any 
L^\LP  with  1  ^  p  <  00,  see  [10],  Subsec.  4.4. 
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The  result  that  {^,  a,  6)  is  a  frame  for  Gaussian  g  and  {ab)~^  >  1,  and  several  general¬ 
izations  of  it,  has  a  rich  history  for  which  we  refer  to  [16],  Subsec.  3.4.4.B,  [26],  pp.  980- 
982,  [30],  Sec.  1.  In  [36],  Lyubarskii  and  Seip  give  a  very  careful  and  detailed  analysis  of 
what  happens  for  Gaussian  g  and  (ab)~^  =  1  when  the  lattice  of  the  relevant  time-frequency 
points  is  slightly  disturbed. 

It  is  unlikely  that  any  of  the  in  (7.4)  coincides  with  the  canonical  dual  °7.  For  even 
values  of  {ab)~^  and  a  =  1  the  canonical  dual  °fa  was  computed  in  [37],  Sec.  6  as 


(7.5)  °7a(t) 


h  °° 

— z - Cfc  exp(-7r(t  -  k/b)^) , 

?93(7rt/a;  exp(-7r/2a2)) 

teR, 


with  a  theta  function,  see  [37],  (6.5),  and  Ck  certain  numbers  decaying  like  exp(-7r  |/c|/26^) 
as  I  A;  I  oo.  This  °7a  cannot  be  extended  to  an  entire  function  and  it  decays  only  like 
exp(-7r|i|/26)  as  t  €  E,  |i|  ^  oo.  For  non-integral  values  of  (a6)~^  it  does  not  seem  easy 
to  determine  the  canonical  dual  (even  the  case  that  (a6)“^  is  an  odd  integer  presents  serious 
problems). 


7.5. 


We  consider  next  the  one-sided  exponential 
(7.6)  a9{t)  =  (2a)  j  e-“‘  X[0,oo)(<) ,  t  6  R  , 


with  q:  >  0.  One  can  again  guess  a  dual  ^7  by  looking  at  the  Wexler-Raz  biorthogonality 
condition  (5.22),  and  one  obtains  as  a  dual  function 


(7.7)  alit)  =  e“‘(xio,a)(i)  -  X[-a,0)(i))  ,teM. 

We  note  that  this  ^7  works  also  for  the  case  that  {ab)~^  —  1,  whence  (a^,  a,  b)  is  a  Gabor 
frame  for  any  o  >  0,  6  >  0  with  {ab)~^  ^  1.  When  (ab)~^  is  an  integer,  one  can  compute  the 
canonical  dual  °7,  see  [37],  (4.11),  as 


(7.8) 


I'rit) 


^  ^  ^  pat-2aa\tl a\  ^ 

^  1  - 

x(X[0,l/6)W  -  6~^“'^^X[-1/6,0)(^))  ,  t  eR, 


and  this  °7  differs  from  the  ^7  in  (7.7)  (unless  (ab)~^  =  1). 

We  note  that  in  [37]  there  are  some  more  specific  examples,  such  as  two-sided  exponen¬ 
tials,  hyperbolic  secants,  for  which  it  is  shown  that  they  yield  a  Gabor  frame  with  integer 
(ab)~^  >  1. 
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7.6. 

While  1  is  the  lower  bound  for  (ah)~^  so  that  (g,  a,  b)  can  be  a  frame,  it  appears  that  the 
chances  of  having  a  frame  increase  with  increasing  value  of  For  integer  values  N  of 

(a6)“^  this  is  apparent  from  Sec.  6,  since  now  the  matrix  v)  is  a  scalar  (p  =  1,  g  =  N), 
given  by 

N-\  .  2 

(7.9)  A^o{t,u)  =  J2\iZ9){t--,i')\  ,a.e.<,veR, 

and  this  quantity  is  positive  and  bounded  for  many  windows  g,  including  certain  smooth  and 
rapidly  decaying  p’s.  More  precisely,  according  to  [26],  Theorems  2.5-6,  for  any  sufficiently 
well-behaved  g  there  are  ac  >  0,  6c  >  0  such  that  (p,  a,  6)  is  a  frame  when  0  <  a  < 

0  <  6  <  6c.  Also  see  [13],  Part  1,  Theorem  6.1.  Such  a  result  can  also  be  obtained  from 
the  frame  operator  representation  (5.15)  (holding  under  condition  A  in  (5.17))  when  (p,  g^^y) 
decays  sufficiently  rapidly  when  ^  oo.  Then  in  the  right-hand  side  of  (5.15)  the  terms 

with  (A:,  1)  ^  (0, 0)  are  small  compared  to  the  term  (a6)“^/,  corresponding  to  (k,  1)  =  (0, 0), 
as  o“^  -f-  6“^  — >■  00.  When  one  allows  p’s  that  are  not  well-behaved,  one  gets  problems  with 
results  of  this  type.  In  [38]  there  is  constructed  for  any  irrational  a  a  smooth,  bounded  g  £ 
such  that  for  any  rational  a  >  0,  6  >  0  the  system  (p,  a,  6)  has  a  finite  frame  upper  bound 
while  for  any  P  >  0  and  any  rational  c  >  0  the  system  (p,  ca,  /?)  has  no  such  bound.  Also 
in  [38]  there  is  an  example  of  a  p,  bounded  and  supported  by  [0, 1],  such  that  the  above 
be  do  not  exist,  and  an  example  of  a  p  such  that  0  is  accumulation  point  of  points  a  such  that 
(p,  a,  6)  has  frame  lower  bound  0  and,  at  the  same  time,  accumulation  point  of  points  a  such 
that  (p,  a,  6)  has  frame  lower  bound  ^  1  (arbitrary  6  €  (0, 1)). 

in. 

It  seems  hard  to  find  a  general  condition  on  windows  pGl/^  (with  reasonable  smoothness 
and  decay  properties)  ensuring  (p,  o,  6)  to  be  a  frame  for  all  or  even  some  specific  values  of 
a  >  0,  6  >  0  with  {ab)~^  >  1.  In  [39]  two  such  classes  of  windows,  for  integer  values  of 
(a6)“\  are  found.  The  first  class  consists  of  all  p  supported,  positive,  strictly  decreasing  and 
continuous  and  integrable  on  [0,  oo),  for  which  (a6)"^  is  allowed  to  take  any  positive  integer 
^  1  as  its  value.  This  class  contains  the  one-sided  exponentials  in  7.5.  The  second  class 
consists  of  all  even,  positive,  continuous  and  integrable  p  such  that  p  has  on  [0,  oo)  the  form 

(7.10)  g(t)  =  b(t)  +  b{t -\- 1)  , 

with  6  strictly  convex  and  positive  on  [0,  oo).  In  these  classes,  the  respective  conditions  of 
strictness  cannot  be  weakened  yielding,  for  instance,  a  well-behaved  rapidly  decaying  p  such 
that  both  p  and  p  are  strictly  positive  while  (p,  1)  is  not  a  frame.  We  also  observe  that  the 

Gaussians  do  not  belong  to  either  one  of  these  two  classes. 
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7.8. 


An  example  of  a  ^  that  just  fails  to  belong  to  the  first  class  in  7.7  is  the  characteristic 
function  X[o,c)  of  the  interval  [0,  c).  Assuming  without  restriction  that  6  =  1  the  author  has 
obtained  the  following  results  for  this  g.  The  frame  upper  bound  condition  is  always  satisfied 
in  this  case,  and  we  can  restrict  to  a  <  1,0  1  for  it  is  not  difficult  to  show  that 

(a)  c  <  a  or  a  >  1  no  frame, 

(b)  a  <  c  <  1  frame, 

(c)  a  —  1,  c  >  1  no  frame. 

Assuming  a  <  1,  c>  1  one  can  show  furthermore  that 

(d)  a  ^  Q,  c  G  (1, 2)  frame, 

(e)  a  =  p/q  G  Q,  gcd(p,  o')  =  1,  2  —  ^  <  c  <  2  no  frame, 

(f)  (2>|,  c  =  L-  l  +  L(1  -  a)  with  integer  L  >  3  no  frame, 

(g)  |c  -  [cj  -  ||  <  i  -  a  =»  frame. 

While  (f)  shows  that  one  may  fail  to  have  a  frame  for  irrational  a,  it  seems  to  hold  that 
one  does  have  a  frame  when  a,  c,  o/c  are  irrational.  And  for  rational  a  =  p/q,  gcd(p,  g)  =  1, 
one  can  give  an  algorithm  that  determines  whether  one  has  a  frame  for  any  c>  1  (complexity 
determined  by  g,  p).  In  particular  one  thus  can  find  rational  a  >  ^  and  c  >  1  such  that  one 
has  a  frame. 

This  shows  that  the  answer  to  the  basic  question  is  already  rather  bewildering  for  windows 
g  as  elementary  as  a  characteristic  function. 

7,9. 


We  continue  by  giving  some  comments  on  inheritance  of  certain  desirable  properties  of 
a  g  G  for  which  (g,  a,  b)  is  a  frame  by  the  canonical  dual.  We  recall  from  Subsec.  4.1  the 
CC-condition  of  essential  boundedness  of  |ez(i)|,  with  given  by  (4.22),  guaranteeing 
the  Walnut  representation  (4.21)  of  the  frame  operator  to  converge  unconditionally  as  an 
operator  of  L^.  It  is  an  open  problem  whether  inherits  the  CC-condition  from  g.  In  [28], 
Theorem  4.14,  it  is  shown  that  for  rational  values  of  (tt6)“^  a  slightly  stronger  condition,  viz. 
the  uniform  CC-condition,  is  inherited  by  °7  from  g. 

A  similar  situation  occurs  for  condition  A  of  Tolimieri  and  Orr,  see  (5.17).  While  it  is  an 
open  problem  whether  this  condition  A  is  inherited  for  all  values  of  (a6)"^  >  1,  it  has  been 
shown  in  [32]  that  it  does  so  for  rational  values  of  (ab)~^. 

7.10. 

The  next  topic  concerns  the  construction  of  a  tight  frame  canonically  associated  to  a 
Gabor  frame.  Given  a  Gabor  frame  (g,  a,  b)  with  frame  operator  Sg,  the  Gabor  frame  (/i,  a,  b) 
with 

h  =  Sg^  g 


(7.11) 
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is  tight.  It  should  be  noted  that  this  h  can  be  computed  using  the  functional  calculus  (with 
(p{x)  =  x~  2)  for  the  frame  operator  in  the  various  domains  (see  the  comments  after  Theo¬ 
rems  3,  10  and  14).  For  instance,  when  (ab)~^  —  q/p  is  rational,  the  h  in  (7.11)  is  given  in 
the  Zak  transform  domain  according  to 

(7.12)  1/)  —  j  a-e.  G  R  . 

In  the  particular  case  that  (ah)~^  is  an  integer  N  (so  that  q  =  N,p=  1)  we  get 

(7.13)  #'*{«,  u)  = - ^  ^  ^  ^  jy  e  R  , 

i.e. 

(7. 14)  (Zh)  (i,  I/)  = - {ZgKt,’') - ^  ,  a.e.  t,  1/  G  R  . 

(Er=o'|(^9)(i  +  i,^)|)^ 

7.11. 

A  further  interesting  point  is  the  problem  of  finding  out  whether  certain  decay  and  smooth¬ 
ness  properties  of  a  ^  generating  a  Gabor  frame  are  inherited  by  the  canonical  dual  °7  or  by 
the  tight  frame  generating  h  of  (7.1 1).  In  [23],  see  Properties  5.5-6,  it  is  shown,  by  using  Ba¬ 
nach  algebra  methods,  that  when  g  G  S  generates  a  Gabor  frame,  then  both  °7  and  of  (7. 1 1) 
are  in  S  as  well.  A  different  proof  of  this  fact,  as  pointed  out  by  K.-H.  Grbchenig,  can  be 
based  on  the  frame  operator  representations  of  Sec.  4  together  with  a  result  of  S.  Jaffard  [40] 
on  inheritance  of  decay  of  the  off-diagonal  elements  of  an  invertible  operator  of  /^(Z)  by  the 
inverse  and  the  inverse  square  root  of  the  operator.  For  inheritance  of  exponential  decay  in 
the  time  and/or  frequency  domain  the  latter  approach  has  been  worked  out  in  [41],  Secs.  2 
and  4. 

There  are  some  more  results  in  [41]  on  inheritance  of  smoothness  and  decay  by  °7  and 
h  from  g.  For  instance,  it  is  shown  that  in  the  case  of  integer  values  of  there  is  no 

inheritance  of  faster-than-exponential  decay  unless  we  are  dealing  with  tight  frames.  Whether 
there  do  exist  tight  frames  (p,  a,  6)  with  a  g  having  faster-than-exponential  decay  in  both  the 
time  domain  and  the  frequency  domain  is  an  open  problem. 


We  finally  make  some  comments  on  discrete-time  Gabor  systems.  We  now  take  N,M  e 
N  and  g  G  /^(Z),  and  we  consider  the  system  of  sequences  gnN,m/M  with  n  e  Z,  m  = 
0, 1, ...,  M  -  1 ,  defined  by 


(7.15) 


9nN,m/M  =  -  nN)Ux  ■ 
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It  turns  out,  see  [27],  Sec.  1.6,  that  the  theory  of  discrete-time  Gabor  systems  with  respect  to 
frame  bound  conditions,  frame  operator  representation,  the  duality  condition  and  characteri¬ 
zation  and  computation  of  canonical  dual  systems  can  be  developed  in  largely  the  same  way 
as  for  continuous-time  Gabor  systems. 

A  basic  question  that  arises  here  is  whether  one  can  obtain  a  discrete-time  Gabor  frame 
by  sampling  the  windows  of  a  continuous-time  Gabor  frame  (p,  N,  1/M)  at  the  integers,  and 
how  the  respective  frame  operators,  representations,  canonical  duals  etc.  are  related.  In  [42] 
it  is  shown  that  (and  how)  this  can  be  done  under  a  decay  condition  and  a  regularity  condition 
on  the  window  g.  These  conditions  are  the  condition  A  of  Tolimieii  and  Orr,  see  (5.17),  for 
the  system  (p,  AT,  1/M)  and  the  regularity  condition 


5 

It  is  shown  in  [42]  that  when  (p,  AT,  1/M)  is  a  Gabor  frame  with  p  satisfying  A  and  /?,  then 
the  canonical  dual  function  °7  satisfies  A  and  R  as  well  (so  that  it  can  be  sampled  at  the 
integers,  just  like  p).  Furthermore,  denoting  the  sampling  operation  by  a  superscript  D,  we 
have  that  (p^,  A^,  1/M)  is  a  discrete-time  Gabor  frame  with  the  same  frame  bounds  as  the 
continuous-time  frame.  Denoting  the  frame  operator  of  (p^,  N,  1/M)  by  we  have  for  a 
large  class  of  A  €  (including  p  and  °7)  that 

(7.17)  {Sghf  =  S°h’=’ . 

In  particular  we  get  that  °7^  is  the  canonical  dual  sequence  for  the  discrete-time  Gabor  system 
(p^,iV,l/M). 
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Abstract,  I  describe  the  work  of  Olevskit,  Tao  and  others  on  the  rearrangement  of 
orthogonal  series.  It  turns  out  that  arbitrary  rearrangements  produce  trouble  for  all 
orthogonal  and  wavelet  methods,  that  decreasing  rearrangements  produce  trouble  for 
Fourier  series,  but  that  wavelet  expansions  continue  to  work  well  under  decreasing 
rearrangement. 


1.  Introduction 


In  this  paper  we  shall  move  without  comment  between  the  circle  T  =  R/Z,  the  closed 
interval  [0, 1]  and  the  half  closed  interval  [0, 1)  as  seems  most  convenient.  When  our  results 
deal  with  almost  everywhere  behaviour  this  does  not  create  any  problems,  when  we  want 
everywhere  convergence  readers  may  have  to  produce  their  own  argument  to  deal  with  the 
point  1. 

What  do  we  mean  when  we  consider 

oo 

/  {u)  exp  27rm^  ? 

u=—oo 


Traditionally  we  take 

N 

lim  7  f(u)  exp  27riut 

jV-j-oo  ' 
u=-N 

and  the  only  variations  that  we  allow  concern  the  mode  of  convergence  (pointwise,  I/,  etc).  A 
sign  that  this  may  be  too  narrow  an  approach  appears  when  we  consider  the  two  dimensional 
case  of  a  function  /  :  C.  The  obvious  way  to  proceed  is  to  take 

Ef(u,  v)  exp  27ri(ut  +  vs)  —  lim  f(u,  v)  exp  27ri{ut  4-  vs) 

(«,u)€Z2  iu,v)eT{N) 
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with  the  r{N)  finite  subsets  of  such  that  r(l)  C  r(2)  C  . , .  and  Un=:1 
However,  it  is  well  known  that,  even  when  /  is  quite  well  behaved,  different  choices  of  the 
sequence  r(A^)  give  rise  to  different  behaviour. 

The  question  of  ‘correct  order’  also  arises,  even  in  the  one  dimensional  case,  from  signal 
processing.  If  we  seek  to  store  and  reconstruct  a  function  /  :  T  ->  C  by  using  its  Fourier 
coefficients  it  is  natural  to  to  use  them  in  decreasing  order  of  magnitude  and  to  consider 

/ (u)  exp  ‘I'Kiui 

rather  than 

f{u)expiut. 

\u\^N 

Any  easy  optimism  about  rearrangements  is  quenched  by  the  following  result. 

Theorem  1.  There  exists  a  real  f  G  L^(T)  and  a  bijection  a  :  Z  Z  such  that 


for  almost  all  f  6  T. 

This  theorem  was  first  stated  by  Kolmogorov.  A  proof  of  Kolmogorov’s  statement  was 
sketched  by  Zahorskii,  given  in  detail  by  Ulyanov  and  much  simplified  by  Olevskii.  The 
reader  who  consults  [7]  will  find  an  excellent  bibliography. 

Polya  says  that  sometimes  the  easiest  way  to  prove  a  result  is  to  generalise  it  and  prove 
the  generalisation.  By  the  time  Kolmogorov’s  theorem  reached  Ulyanov  and  Olevskii  it  had 
taken  the  following  form. 

Theorem  2.  Let  0i,  (j)2,  ♦  •  ^form  a  complete  orthonormal  system  in  L^([0, 1)).  Then 

there  exists  a  real  f  G  L^(T)  and  a  bijection  cr  :  N  — >  N  such  that 

N 

limsup  y^/(a{w))<^^(„){f)  r=oo. 

for  almost  all  t  e  [0,1). 

Here  as  usual  we  write 

f(<l>r)  =  {f,4>r)  =  [ 

Jo 

In  fact  Olevskii  improved  Theorem  2  by  replacing  /  G  L^(T)  by  /  continuous.  We  shall 
prove  a  result  which  includes  this  as  Theorem  5. 

In  the  years  since  Olevskii  published  his  result,  more  general  systems  than  those  of  com¬ 
plete  orthonormal  type  have  assumed  practical  importance. 
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DEnNITION  3.  We  say  that 

<t>u  02,  03,  •  •  • 

form  a  Riesz  basis  for  L^([0, 1))  if  the  linear  span  of  the  <t>n  is  dense  in  L?'  and  there  exists  an 
A  with  i4  >  1  such  that 

OO  00  ^  00 

n=0  n=0  2  ^^=0 

We  call  A  the  Riesz  constant  of  the  system.  Easy  functional  analysis  reveals  the  following 
lemma. 

Lemma  4.  Let  <j)i,  (j)2,  03, . .  .form  a  Riesz  basis  for  L^([0, 1)).  Then  there  exists  a  unique 
sequence  '0i,  '02.  03.  •  •  •  of  bounded  norm  such  that  1  (0^’  /)  P  <  oo  and 

OO 

/  =  5Z{V'n,/)<;in 

n=0 

for  every  f  el?.  If  f  =  a„(^„  then  a„  =  (V'n,  f)for  all  n. 

We  write /(n)  =  (Vn,/)- 

As  Olevskii  indicates  very  clearly  his  method  can  be  extended  to  Riesz  bases. 

Theorem  5.  Let  01,  02.  03.  •  •  -form  a  Riesz  basis  for  L^([0, 1]).  Then  there  exists  a  real 
continuous  f  and  a  bijection  a  :  N  ^  N  such  that 

N 

limsup  y'/(<7(w))0^(„)(i)  =oo. 

u=0 

for  almost  all  t  €  [0, 1]. 

If  we  ask  for  which  complete  orthonormal  system  Theorem  2  is  least  plausible  one  answer 
would  be  the  Haar  system.  Set 

A{N)  =  {(r,  5)  6  :  0  <  s  <  2^  —  1  and  AT  ^  r  ^  0} 

and  A  =  UAr>o  '^(•^)*  Consider  intervals  of  the  form 

^(r,  5)  =  [52-  (5  +  1)2-) 

where  (r,  5)  6  A.  We  define 

Xr,s(i)  =  1  ifte  £'(r  +  1,25) 

Xr,s{^)  =  -1  if  ^  C  E{r  +  1, 25  +  1) 

Xr,s(^)  =  0  Otherwise 

where,  again,  (r,  5)  G  A.  We  call  the  Xr,s  together  with  the  function  1  =  x-1,0  the  Haar 
system.  We  call  the  Tr,s  =  2’-/^Xr,s  together  with  the  function  1  the  normalised  Haar  system. 
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It  is  well  known  that  the  normalised  Haar  system  is  a  complete  orthonormal  system.  If  we 
put  the  standard  order 

(r,  s)  <C  (r',  s')  if  r'  >  r  or  r  =  r'  and  s'  s 

on  the  Haar  system  it  enjoys  remarkable  convergence  properties.  For  example  if  /  is  contin¬ 
uous 

/(«,  v)T„,„  -)■  / 

(«,v)<(r,s) 

uniformly  as  we  allow  (r,  5)  to  increase  so  as  to  exhaust  the  system. 

If  we  can  prove  Theorem  2  for  the  Haar  system,  one  is  tempted  to  say,  we  can  surely  prove 
it  for  any  orthonormal  system.  Olevskii  showed  that  this  is  indeed  the  case  and  we  shall  see 
that  once  we  have  Theorem  2  for  the  Haar  system  we  can  obtain  the  rest  of  the  theorem  from 
this  particular  case.  As  we  might  expect  on  general  grounds  the  key  to  the  Haar  system  case 
lies  in  a  finite  version  of  the  theorem. 


Lemma  6.  e  >  0  and  K  ^  Ibe  given.  Then  we  can  find  a  bijection 
ct:  {1,2,3,.. .,2"«-l}^A(iV) 
and  real  ar^s  s)  €  A(Ar)]  such  that 


for  all  i  €  [0, 1)  but 


{r,s)eA{N) 


for  allt  ^  E  where  E  is  a  set  of  measure  at  most  t. 

The  next  section  is  devoted  to  the  proof  of  this  key  lemma.  Once  it  is  understood  the  rest 
is  relatively  routine. 


2.  Olevskii’s  Lemma 

We  shall  need  two  simple  results,  the  first  combinatorial  and  the  second  probabilistic. 

Lemma  7.  If  01^02, . . . ,  ^  0  and  am+i^cim+2,  ■ . . ,  a^v  ^  0  then 

3  .  N 

Xy 


max 
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Proof.  Observe  that 

N  m  N 

|o^ I  =  2  ^  3  max 

r=l  r=l  r=l 

□ 

Lemma  8.  Let  Xi,  X2, . . . ,  Xf^  he  independent  identically  distributed  random  variables 
with  Pr(A^r  =  1)  =  Pr(Xr  =  — 1)  =  1/2  and  let  K  be  an  integer  with  K  ^  1.  Then 

(i)  Pr  ^  if)  <  2Pr  (E^Li  >  A")  • 

(ii)  Pr  (maxi^,^;,  E^=i^r  ^  /c)  <  4Pr  (|Ef=i^r|  ^  k)  . 

(iii)  Pr  ZLiXr  ^  if)  <  2NK-^. 

Proof.  (i)  This  is  the  simplest  form  of  the  reflection  principle.  (See  e.g.  [2]  Chap¬ 
ter  3.) 

(ii)  Use  symmetry. 

(iii)  Use  Chebychev’s  inequality  to  bound  Pr  ^  . 

□ 

Of  course,  we  can  get  much  better  estimates  in  (iii)  but  the  only  use  I  can  And  for  such 
estimates  is  to  study  the  Hausdorff  dimension  of  the  exceptional  set  of  convergence  in  Theo¬ 
rem  2  when  applied  to  Haar  functions. 

By  using  the  standard  interpretation  of  Haar  functions  in  terms  of  coin  tossing,  Lemma  8 
gives  the  following  result. 

Lemma  9.  IfN  and  K  are  strictly  positive  integers  then  we  can  find  Op^g  taking  the  values 
0  or  1  [(p,  q)  €  A(A^)]  and  a  set  E  of  measure  at  most  ANK~‘^  such  that 

^p,qXp,q{'l')  ^  X 

{p,q)eA{N) 

for  all  t  e  [0, 1),  but 

\^p,qXp,q{i^)\  —  X 

{p,q)eA{N) 

for  all  t  ^  E. 

Proof.  Consider  (0, 1)  with  Lebesgue  measure  as  a  probability  space.  Let 

Xj{t)  =  (-1)P^‘I 
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(where  [2^t]  is  the  integer  part  of  2H).  The  Xj  satisfy  the  conditions  of  Lemma  8.  It  follows 
by  Lemma  8  (iii)  that  the  set 


has  measure  at  most  AN K 

To  define  ap^g  we  look  at  the  interval  E(p,q)  =  (p  +  1)2"'^)  and  observe  that 

Yji=i  ^r{i)  is  constant  on  E{p,  q)  for  all  1  ^  j  ^  ^  —  1.  We  may  thus  define  ap^q  =  1  if 


max 


3 


-I 


for  t  e  E{p,  q),  and  ap^q  =  0  otherwise. 

If  we  now  set 

^p,gXp,q{^) 

(p,g)eA(A^) 

then  Y  is  the  random  variable  defined  by 


r=l 


if  I  maxi^j^iv  EJ=:i  Wl  ^  N  but 

m 

Y(t)  =  J2Mt) 

r=l 

where  j{t)  is  the  smallest  j  with  |  XlrS  =  E,  In  more  vivid  terms,  toss  a  fair  coin 

keeping  track  of  the  difference  between  the  number  of  heads  and  tails  thrown.  If  this  ever 
takes  the  value  K  or  ~K  stop  and  record  the  value  as  Y.  If,  after  N  throws  this  has  not 
happened,  record  the  value  after  N  throws  as  Y.  By  definition  |F (t)\  K  and  if  t  ^  E  (so 
we  complete  the  N  throws) 

^  ^  \^p,qXp,q{^)\  ~  E 

(p,9)€A(N) 

as  required.  □ 


Now,  instead  of  taking  the  sequence  of  heads  and  tails  as  chance  presents  them,  we  wish 
to  take  all  the  heads  first  and  then  all  the  tails.  To  do  this  we  introduce  the  Olevskii  order  on 
A 


(p',g')  t  (p,q)  if  W  +  1)2-^  ^  (2p  +  1)2-’. 
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The  Olevskii  order  is  more  intricate  than  may  at  first  appear  and  the  reader  should  try  ordering 
the  (p,  q)  with  (p,  q)  €  A(A^)  for  =  5  or  iV  =  6.  Observe  that  (p',  q')  y  x(p,  q)  if  the  mid¬ 
point  of  supp  Xv',q'  is  to  the  right  of  (i.e.  greater  than  or  equal  to)  the  mid-point  of  supp 
We  observe  that  Xp,q{i)  ^  0  if  t  is  strictly  to  the  left  of  (i.e.  strictly  less  than)  the  mid-point 
of  supp  Xv,q  Xp,9(*)  <  0  if  t  is  to  the  right  of  (i.e.  greater  than  or  equal  to)  the  mid-point 
of  supp  Xp,g-  It  follows  that  for  each  t  e  [0, 1)  there  is  a  A(i)  e  A(A')  such  that 

Xp,g{t)  <  OforA  h  {p,q),  Xp,g{t)  >  0for(p,g)  h  Xa- 
Using  this  observation  we  easily  arrive  at  the  result  we  require. 


Lemma  10.  Suppose  that  N  and  K  are  strictly  positive  integers  and  and  E  are  as  in 
Lemma  9.  Then 


max 

X£A{N) 


^P,qXp,q{'l^) 

^t(p>q) 


>N/3 


for  all  t  ^  E. 


Proof.  By  Lemma  7  and  the  last  sentence  of  the  previous  paragraph 

I  Zl  -  (P’  <iy,^Xp,M  \>\  (<)  I  =  N/3 


(p,?)6A(W) 


for  all  t  ^  E. 


□ 


Theorem  11.  Let  e  >  0  and  k.  >  1  be  given.  Then  we  can  find  an  N  ^  I,  1  h  >  3, 
bp^g  taking  the  values  0  or  b[{p,q)  G  A  (AT)]  and  a  set  E  such  that 

(i)  I  £{p,g)eA(JV)  ^p,qXp,q{'^)\  ^  I  for  all  t  €  [0, 1), 

(ii)  maxxeA{N)  \  Eax(p,9)  W I  ^  allt^E 

(iii)  \E\  <  e. 


Proof.  Choose  an  integer  M  with  M  >  4e”^  and  >  3k.  Set  N  —  M®,  K  = 
and  choose  Op^g  and  E  as  in  Lemma  9.  If  we  now  put  bp^g  —  ap^g/K,  b  =  IJK  then  all  the 
conclusions  of  the  lemma  with  the  exception  of  condition  (ii)  follow  at  once  from  Lemma  9. 
Condition  (ii)  itself  follows  from  Lemma  10.  □ 


Standard  ‘rolling  hump’  (or  ‘condensation  of  singularities’)  methods  now  give  the  fol¬ 
lowing  result. 


Exercise  12.  Let  Xn  ^  N]  be  the  Hoar  system  enumerated  in  some  way.  Then  there 
exists  a  real  f  e  L°°([0, 1))  and  a  bijection  cr :  N  — N  such  that 


lim  sup 

TV -400 


N 

Y^fHu))Xp(u){t) 

ti=0 


—  oo. 


for  almost  all  t  G  [0, 1). 


no 
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We  leave  it  as  an  exercise  for  the  reader  because  we  shall  prove  stronger  results.  However 
these  results  will  require  extra  complications  in  their  proofs  and  the  reader  may  find  the  more 
complex  proofs  easier  to  follow  if  he  or  she  has  worked  through  an  easier  case. 


3.  Extension  To  General  Spaces 

We  have  not  yet  exhausted  the  strength  of  Polya’s  dictum.  What  happens  if  we  seek  to 
extend  Theorem  2  to  more  general  measure  spaces?  A  moment’s  reflection  brings  to  mind 
the  the  fact  that,  so  far  as  measure  theory  is  concerned,  all  nice  measure  spaces  which  are 
not  obviously  different  are  the  same.  Recall  that  a  measure  space  (X,  fi)  with  positive 
measure  fi  is  non-atomic  if  given  E  e  T  with  ij,{E)  >  0  we  can  find  F  e  E  with  F  C  E  and 
IJ,{E)  >  fJi{F)  >  0.  The  following  result  is  typical  (see  e.g.  theorem  9,  page  327  of  [8]). 

Theorem  13.  Let  X  be  a  complete  separable  metric  space  equipped  with  Bx  its  o- 
algebra  of  Borel  sets  and  p  a  non-atomic  measure  on  Bx-  If  I  =  [0, 1]  is  the  unit  interval 
with  its  usual  metric,  Bj  its  a-algebra  of  Borel  sets,  and  m  is  the  usual  Lebesgue  measure 
then  there  exists  a  bijective  map  F  :  I  X  such  that  F  and  F~^  carry  Borel  sets  to  Borel 
sets  of  the  same  measure. 


We  shall  not  use  Theorem  13  but  we  shall  use  the  lemma  which  underlies  its  proof  and 
the  proof  of  results  like  it. 

Lemma  14.  Let  (X,  F,  p)  be  a  non-atomic  probability  space.  IfE^F  and  1  ^  o  ^  0 
then  we  can  find  F  E  F  with  E  D  F  and  p{F)  =  ap{E). 


Lemma  17  follows  easily  from  a  lemma  of  Saks  given  as  Lemma  7  of  section  IV.9.8  of 
Dunford  and  Schwartz  [1].  There  is  a  discussion  of  isomorphism  theorems  in  Chapter  VIII  of 
Halmos’s  Measure  Theory  [3].  However  before  readers  rush  off  to  inspect  the  wilder  shores 
of  measure  theory  they  should  note  that  all  they  will  learn  is  that  they  could  have  stayed  at 
home,  since  [0, 1]  with  Lebesgue  measure  is  the  type  of  all  non-atomic  measure  spaces. 

In  view  of  the  preceding  discussion,  it  is  natural  to  aim  at  the  following  generalisation  of 
Theorem  2.  (The  extension  of  Definition  3  to  general  probability  spaces  is  obvious.) 


Theorem  15.  Let  (X,  F^  p)  be  a  probability  space  with  p  non-atomic.  Let  (pi,  (p2,  be 
a  Riesz  basis  in  L^(X).  Then  there  exists  a  real  f  G  L‘^{X)  and  a  bijection  cr  :  N  -)■  N  such 
that 

I  N  I 


limsup  y^/(cr(M))0„(„)(t) 


—  00. 


for  almost  all  t  E  X. 


The  result  is  clearly  false  if  p  is  not  non-atomic.  Suppose  E  E  F  is  an  atom,  that  is 
p{E)  >  0  and  if  F  E  F  with  F  C  E  then  p{F)  =  p{E)  or  p{F)  —  0.  Then  if  Qn^  9  ^ 
and  ll^n  —  5II2  — 0  it  follows  that  gn{t)  — )■  g{t)  for  //-almost  all  t  E  E. 


T.W.  Komer  /Does  Order  Matter 


111 


It  is  not  hard  to  obtain  Theorem  15  from  Theorem  2  by  the  standard  argument  used  to 
prove  results  like  Theorem  13.  However,  it  is  more  interesting  to  ask  how  we  might  tackle 
Theorem  15  directly  and,  in  particular,  what  we  are  to  make  of  Theorem  11  in  the  general 
context.  If  we  do  so  we  are  rewarded  with  a  key  insight  —  Theorem  1 1  is  a  combinatorial 
theorem. 

Theorem  16.  Let  (X,  jL)  be  a  probability  space.  Let  e  >  0  and  k  >  Ibe  given.  Then 
we  can  find  1^6>0  and  bp^g  taking  the  values  Oorb  [(p,  q)  e  A(A^)]  with  the 

following  property. 

Suppose  that  we  have  a  collection  of  sets  En^r  ^  ^  such  that 

(A)  E{2r  —  1,  n  4- 1)  n  E{2r,  n  +  1)  —  0,  E{2r  —  1,  n  +  1)  U  £’(2r,  n  +  1)  =  E{r,  n)for 
all  (r,  n)  G 

(B)  \E(r,n)\  =  2-^  for  all  (r,n)  G  A^v+i- 
Let  us  set 

Hr, nit)  =  1  forte  E{2r  -  1,  n  +  1), 

^r,nit)  —  -I  fort  e  E{2r, n  +  1), 

^r,nit)  —  0  Otherwise, 
whenever  (r,  n)  G  Ajv.  Then 

W  I  Y^{p,q)e^{N)  9(^)1  ^  I  for  all  t  G  [0, 1), 

and  there  is  aset  E  e  T  such  that 

(ii)  max^^A(/9)  |  Ylxy{p,q)  ^p,Q^pyqit)\  ^  k  for  all  t  ^  E 

(iii)  |£;|  <  e. 

Proof.  This  is  just  Theorem  11.  □ 

To  see  why  this  is  indeed  an  insight  let  us  return  to  the  concrete  case  of  Lebesgue  mea¬ 
sure  on  [0, 1)  and  consider  the  most  important  Riesz  basis  of  all,  the  exponentials  e„(t)  ~ 
exp(27rini).  If  we  try  to  convert  Theorem  11  into  a  result  on  Fourier  series  we  run  into  the 
problem  that  different  Haar  functions  do  not  ‘occupy  different  parts  of  the  frequency  spec¬ 
trum’,  (Indeed  the  Fourier  coefficients  of  the  Haar  fonctions  Xp,q  with  fixed  p  are  all  of  same 
amplitude  differing  only  in  phase.)  However,  if  we  ‘shuffle  [0, 1)’  the  Ep^g  can  be  chosen  so 
that  the  Hp^g  are,  for  practical  purposes,  in  ‘different  parts  of  the  frequency  spectrum’.  That 
is,  although  we  do  not  have  the  ideal  outcome  in  which  at  most  one  of  the  Hp,g{j)  is  non-zero 
for  each  j,  we  can  arrange  that  at  most  one  of  the  Hp,gij)  is  large  for  each  j. 

Since  there  seems  no  further  advantage  in  considering  general  measure  spaces  we  shall 
stay  within  [0, 1)  and  [0, 1].  However,  readers  who  wish  to  work  more  generally  will  find  that 
they  only  need  the  following  simple  consequence  of  Lemma  14. 

Lemma  17.  Let  {X,  T,  fi)  be  a  non-atomic  probability  space.  If  F  e  T  and  p(F)  >  0 
we  can  find  a  sequence  ej  of  orthogonal  functions  and  sets  Fj  G  T  with  Fj  C  F  such  that 
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(i)  ej{t)  —  I  fort  £  Fj,  ej{t)  =  -1  fort  £  F\  Fj,  ej(t)  =  0  otherwise, 

(ii)  =  ii{F)l2. 

Proof,  Set  £’(0,0)  ==  F.  By  repeated  use  of  Lemma  14  with  a  =  1/2  we  can  find 
En^r  ^  F  such  that 

(A)  £(2r  -  1,  n  +  1)  n  £(2r,  n  -j- 1)  =  0,  E{2r  -  1,  n  -f  1)  U  £(2r,  n  +  1)  —  £(r,  n)  for 
all  1  <  r  ^  2”, 

(B)  |£(r,n)|  =  2-”|F|  for  all  1  ^  r  <  2". 

Now  set  Fj  =  Urli  F{2r  -  1,  j  +  1)  and  define  ej  as  in  condition  (i).  □ 

4.  Extension  to  general  Riesz  bases 

In  this  section  we  work  in  [0, 1]  with  Lebesgue  measure.  In  order  to  put  into  effect  the 
programme  sketched  at  the  end  of  the  last  section  we  need  a  sequence  of  easy  lemmas.  It  may 
be  helpful  for  the  reader  to  keep  in  mind  as  examples  both  the  ‘well  behaved’  orthonormal 
system  of  exponentials  and  some  other  system  where  ^ 

Lemma  18.  Let  01,  </>2,  ...be  a  Riesz  basis.  If  Ci,  62, . .  .form  an  orthonormal  sequence 
then 

ek(j)  ->  0 

as  k  00  for  each  fixed  j. 

Proof.  Referring  back  to  Lemma  4  we  see  that  e/t(j)  =  and  the  result  is  obvi¬ 
ous.  □ 

Lemma  19.  Let  (j)\,  (j)2,  ...be  a  Riesz  basis.  Suppose  that  5  >  0,  M  ^  0  and  that 
F  is  a  set  of  strictly  positive  measure.  Then  we  can  find  E  a  measurable  subset  of  F  with 
|£;|  =  |£|/2  and  M'  >  M  such  that  if  we  set  H{t)  =  1  fort  £  E,  H(t)  =  —1  for  t  £  F\E 
and  H{t)  =  0  otherwise  then 

M  00 

j—\  j~M’ 

Proof.  Combining  the  results  of  Lemma  17  and  Lemma  18  we  see  that  we  can  find  E  a 
measurable  subset  of  F  with  |£|  —  l£|/2  such  that,  if  H  is  defined  as  stated, 

M 

j=l 

Since  H  £  L'^  there  exists  an  M'  >  M  such  that  Y1T=m'  so  we  are  done.  □ 

We  now  have  our  basic  construction. 
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Theorem  20.  Let  01,  (j)2,  ...be  a  Riesz  basis.  Suppose  that  J  >  0  and  N,  M'(0, 0)  ^  1 
are  given.  Then  we  can  find  integers  M(r,  n)  <  M'(r,  n)  with  M(r,  n)  >  M'(0, 0)~together 
with  a  collection  of  measurable  sets  E(n,  r)  such  that 

(A)  E{2r  ~  1,  n  +  1)  n  E{2r,  n  +  1)  -  0,  E(2r  -  1,  n  +  1)  U  E(2r,  n  +  l)  =  E{r,  n)for 
all  (r,n)  e  A^, 

(B)  \E{r,n)\  =  2'^  for  all  (r,n)  e  A^+i. 

(C)  if  we  write  A(r,n)  =  {k  :  M{r,n)  ^  k  ^  M'(r,n)}  then  A{r,n)  n  A{s,m)  =  0 
whenever  (r,  n)  ^  (5,  m), 

and  such  that,  if  we  set 

Hr,n{t)  =  1  forte  E{2t  -  1,  n  +  1), 

Hr,n{t)  =  -1  for  t  €  E{2r,  n  +  1), 

=  0  otherwise. 


whenever  (r,  n)  e  A^^  then 


Y,  \HrM)?<6. 

j^A(r,n) 


Proof.  Use  Lemma  19  repeatedly.  □ 

Theorem  21.  Let  e  >  0  and  k>  1  be  given  and  take  N  ^  l,b  and  bp^q  [(p,  q)  e  A{N)] 
be  as  in  Theorem  16.  Suppose  that  (j)i,  ^2,  •  •  •  is  a  Riesz  basis,  5  >  0  and  M'(0, 0)  ^  1  are 
given  and  that  £'(n,  r),  A(r,  n)  and  ore  constructed  as  in  Theorem  20.  Then  if  we  set 

(P,9)€A(iV) 


we  have 

(i)  ll/IU^l. 

Further,  provided  only  that  6  is  small  enough,  there  is  a  measurable  set  F  such  that 

(ii)  maxA6A(jv)  I  Ea^Cp.,)  EjeA(p,,)  l/0>i(*)l  >  «  -  I  for  all  t  i  F, 

(iii)  |i?|  <  2€. 
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Proof.  Conclusion  (i)  is  just  conclusion  (i)  of  Theorem  20.  To  obtain  (ii)  and  (iii)  we 
proceed  as  follows.  Observe  that 

E  E  -  E 

<  E  E  E 

j^A(p,9)  jeA(p,g)  2 

<  E  E  ^p..-  E  • 

>^b(P.9)  i^A(p,9)  j€A(p,9)  2 

But,  for  each  {p,q)  €  A(A^), 

E 

jeA{p,<7)  2 


E 

+  E 

i^^A(p.9) 

2  (t-,s)?^(p,<7) 

jeA{p,q) 

E 

+  E 

j^A{p,q) 

2  (r,s)7^(p,qr) 

j^A(r,s) 

“  ^r,s(i)0j 

(r,s)€A(N)  ;^A(r,5)  2 

But,  writing  A  for  the  Riesz  constant  of  the  basis.  Theorem  20  tells  us  that 

2 

E  HrAi)<t>j  E  \Ks{i)?<AS. 

j^A{r,5)  2  f^A{r,s) 

Thus,  retracing  our  steps, 

Hp,,-  E  <2’'{A5Y'^ 

jeA{p,q)  2 


T.W.  Komer  /Does  Order  Matter 


115 


and 

E  E  -  E 

Ab(P.9)j€A(p,g)  \y{p,q)  2 

for  all  A  e  A(^'). 

If  we  now  write 

=  I  i  f{j)^j  ~  ^P,Q^P,Q  ^  1 

A^(p,<7)  J^A(p,<jf)  A^(p,g) 

then  Chebychev’s  inequality  and  the  last  inequality  of  the  preceding  paragraph  tell  us  that 

\Fx\  <  2^^A5. 

Thus  if  we  set  F  =  £;  U  UAeA(iV)  conclusion  (ii)  follows  from  conclusion  (ii)  Theorem  20 
whilst  conclusion  (iii)  Theorem  20  tells  us  that 

|F|  <  e  +  2^^ AS, 

and  (iii)  holds  provided  only  that  6  is  small  enough.  □ 

Theorems  20  and  21  give  us  what  we  want.  However,  we  have  accumulated  a  fair  amount 
of  notation  in  the  course  of  the  construction  which  we  can  now  jettison  to  provide  a  simpler 
conclusion. 

Theorem  22.  Let  <j)i,  <1)2,  ...be  a  Riesz  basis.  Given  K  ^  1  we  can  find  an  integer 
M{K)  ^  1  with  the  following  properties.  Given  any  integer  m  ^  1  and  any  77  >  0  we  can 
find  a  function  f  G  L°^{[0, 1]),  an  integer  m'  >  m,  a  bijection  cr  :  N  -)■  N  with  a{r)  -  r 
fori  and  for  r  ^  m’,  integers  m  <  p(l)  ^  p(2)  ^  p(3) . .  .p{M)  <  m'  and  a 

measurable  set  E  such  that 

(i)  I  I/I  loo  <  1, 

oi)Er=ii/o)p^'/.  _ 

(iii)  |  'E,f=l  ^  K  for  all  t  e  E, 

(iv)  |£'|  >  1  -  K-\ 

The  distance  from  i'”([0, 1])  to  C([0, 1])  is  usually  not  very  great.  The  present  case  is  no 
exception. 

Theorem  23.  In  Theorem  22  we  may  take  f  continuous. 

Proof.  This  is  entirely  routine.  Let  the  M[K)  of  our  new  Theorem  23  be  chosen  to  be 
the  M {2K  +  2)  of  the  old  Theorem  22.  Then,  by  Theorem  22,  given  any  m  ^  1  and  any 
7?  >  0  we  can  find  a  function  /  G  L°°([0, 1)),  an  integer  m'  >  m,  a  bijection  a  :  N  -)■  N  with 
a(r)  =  r  for  1  ^  r  <  m  and  for  r  ^  m\  integers  m  ^  p(l)  <  p(2)  <  p(3) , ,  .p(M)  ^ 
and  a  measurable  set  E'  such  that 
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(iii)  '  maxi^fc^M  |  EjS  f{^U))Mj)i'^)\  >  K  +  l  for  a[\te  E, 

(iv)  '  |£:'l  >  1  -  K-^/2, 

Let  J  >  0  be  a  small  number  to  be  determined  and  choose  /  E  C([0, 1))  such  that  /  satisfies 
condition  (i)  and  \  \f  -  g\\2  ^  S/A^.  Automatically 

/  oo  \  1/2 

so  condition  (ii)  is  satisfied  provided  only  that  we  choose  S  small  enough. 

Next  we  observe  that 

p(fc)  p(/i:)  /  p(fc)  \  ^ 


Thus  writing 

ifW  p(*)  ] 

i=l  3=1 

we  have,  by  Chebychev’s  inequality, 

p{k)  p{k) 

j=l  j=l  2 

Thus,  setting  E  =  E'\  Uf=i»  we  see  that  (iii)  holds  automatically  and 


\E\^\E'\-^\E,\^{l-n/2)-N6 


SO  that  (iv)  holds  provided  only  that  we  choose  S  small  enough. 


The  remainder  of  the  construction  follows  a  standard  rolling  hump  (condensation  of  sin¬ 
gularities)  pattern  using  Chebychev’s  inequality  in  the  same  way  as  in  the  two  previous 
proofs.  In  view  of  the  amount  of  notation  involved  readers  will  probably  prefer  to  do  it 
themselves  rather  than  follow  my  proof. 

It  is  easy  to  put  the  bricks  of  Theorem  23  together. 
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Lemma  24.  Let  (j)i,  (j)2,  ...be  a  Riesz  basis  with  constant  A,  Let  m'(0)  =  1  We  can  con¬ 
struct  inductively  positive  integers  M{n),  m{n),  m!{n)  with  m'(n  -  1)  <  m{n),  continuous 
functions  fn,  bijectionson  :  N  with an{r)  =  r/orl  <  r  ^  m'(n-l)  andform\n)  ^  r, 

integers  m  ^  p(l,  n)  ^  p(2,  n)  ^  p(3,  n) . .  .p(n,  M(n))  ^  a  measurable  set  E  such 

that 

(i)  B  ll/niu  ^  2-". 

(ii)  n  \fnir)?  ^ 

(iii)  „  I  /n(o'U))<A<r„(i)(*)l  ^  2'^  for  all  t  €  E, 

(iv)  „  |F„|  >  1  -  2-". 

(V)  n  E"=x  Er=™-w  l/xWP  <  +  l)-l 


Proof.  Conditions  (i)n  to  (iv)„  come  directly  from  Theorem  23.  The  key  point  is  that 
Theorem  23  allows  us  to  define  M(n  +  1)  before  we  define  m'(n)  in  such  a  way  as  to  satisfy 
condition  (v)n.  D 


Proof  of  Theorem  5.  As  I  said  above,  this  is  routine.  It  is  easy  to  check  that  a{r)  = 
cr„(r)  for  m'(n  -  1)  <  r  <  m!{n)  gives  a  well  defined  bijection  cr  :  N  ->  N.  By  the 
conditions  (i)„  =  Z)r=i  fr  converges  uniformly  to  a  continuous  function  /  as  n  — >  oo. 
Trivially 


p{kyn)  p{k,n) 

j=l  J  =  1 


^  Il^n-l||cx5  +  |G^fc,nWl 
^  1  +  |Gfc,n(i)|? 


where 


n— 1  oo  oo  p{k,n) 

Gk,n{t)  =  XI  +  X^  S 

r=l  j:=p(k,n)+l  r=n+l  j=l 
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Using  the  properties  of  Riesz  bases  together  with  conditions  (ii)(r)  and  (v)(r)  we  have 
ll^fc.nlb 

'|p(A:,n) 

i=l 


n— 1 

r=l 


j=p(fc,n)+l 
n-1  /  00 


^=n+l 

oc  /MM) 

r=l  \^j~p{k,n)-\-l  J  r=n+l  ^  j-1 

n-1  /  00  \  00  / m{n)  ^ 

^  A>/^E  E  \frW))\^  +  Y  Y  \fMjW 

r=l  \j-m'{n-l)  J  r=n+l  ^^=1 

n-1  /  00  \  oo  /m{r-l)  ^ 

E  l/rW:'))!'  E  E  l/>0))P 

r-l  \^j~rn'{n-\)  J  r=n+l  y  j=l 

oo 

r=n+l 

Thus  writing 

E{k,n)  =  {t :  |Gt,„(t)|  ^  1}, 

we  have,  by  Chebychev’s  theorem,  \E(k,  n)|  ^  2“’*M(n)“*  and,  if  we  set 

M(n) 

E'{n)  =  E{n)  (J  E(k,n), 


k=l 


we  have,  by 
and  by  (iii)n 


|^'(n)l^2- 


n-l-l 


max 

l^k^M{n) 


p(k,n) 

i=i 


>  2"-2 


for  all  t  €  B'(n).  The  theorem  follows. 


□ 


The  reader  may  readily  check  that  Theorem  15  may  be  obtained  by  very  similar  arguments 
as  can  the  following  direct  generalisation  of  Theorem  5 

Theorem  25.  Let  (X,t)  be  a  compact  Hausdorjf  space.  Let  [X,T,p)  be  a  regular 
non-atomic  probability  space.  Let  (^i,  (^2»  •  •  -form  a  Riesz  basis  for  L'^{p).  Then  there  exists 
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a  real  continuous  f  and  a  bijection  <7  ;  N  ->  N  such  that 


lim  sup 

N-^00 

for  almost  all  t  e  X. 


N 


u=0 


—  00. 


5.  Further  Questions 

It  is  natural  to  ask  if  our  results  can  be  improved  so  as  to  replace  ‘divergence  almost  ev¬ 
erywhere’  by  ‘divergence  everywhere’.  The  obvious  answer  is  no,  since  if  we  take  a  complete 
orthonormal  system  on  [0, 1)  and  define  by  ^n{i)  =  (l>n{t)  for  t  ^  0,  ^n(O)  —  0  the 
result  remains  a  complete  orthonormal  system  but  any  linear  combination  of  a  finite  set  of 
will  take  the  value  0  at  zero. 

The  “unfair”  example  just  given  means  that  we  must  reformulate  our  question  and  sug¬ 
gests  that,  at  least  initially,  we  should  consider  particular  complete  orthonormal  systems.  In 
the  case  of  Fourier  series  we  recall  the  remarkable  theorem  of  Kahane  and  Katznelson  that 
every  set  of  measure  zero  is  a  set  on  which  the  Fourier  sum  of  a  continuous  function  diverges 
(see  e.g.  [4],  Chapter  II,  Section  3)  and  consider  the  following  sequence  of  lemmas. 

Lemma  26.  Given  any  c  >  0,  any  K  >  1  and  any  integer  N  I  we  can  find  a 
trigonometric  polynomial  P,  a  set  E  which  is  the  union  of  a  finite  set  of  intervals  and  a 
bijection  a  ;  N  ^  Z  such  that 

(i)  ||P||oo  <  1, 

(ii)  P(n)  =  0  for  all  n  ^  N, 

(iii)  maxk^o  \  ^  K  for  all  t  ^  E. 

(iv)  |£;|  <  e. 

Proof.  Apply  de  la  Vallee  Poussin  summation  to  Theorem  23  (with  the  exponentials 
exp(27rinf)  as  the  Riesz  basis)  to  obtain  a  P  satisfying  all  the  conditions  except  possibly  (ii). 
To  obtain  (ii)  it  suffices  to  replace  P{t)  by  exp{iMt)P{t)  with  M  a  suitable  large  positive 
integer.  □ 

Lemma  27  (Kahane  and  Katznelson).  Given  any  K  >  1  there  exists  an  e{K)  with  the 
following  property.  Given  any  set  E  which  is  the  union  of  a  finite  set  of  intervals  and  has 
\E\  <  e(K)  and  any  integer  N  '^1  we  can  find  a  trigonometric  polynomial  P  such  that 

(i)  llPiloo  ^  1, 

(ii)  P{n)  =  0  for  all  n  <  A, 

(iii)  maxt^olE,-^*:-PO)exp(ui)|  ^  Kforallti  E. 

Proof.  See  [4],  Chapter  II,  Section  3.  □ 

Combining  the  last  two  lemmas  we  obtain  the  following  result. 
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Lemma  28.  Given  any  e  >  0,  any  K  >  \  and  any  integer  N  I  we  can  find  a 
trigonometric  polynomial  P  and  a  bijection  a  :  N  — Z  such  that 

(i)  IIPIU  <  1, 

(ii)  P(n)  —  a  for  all  N, 

(iii)  maxjt^o  I  j2j^k  P{<r(j))  exp{2m<r{j)t)\  ^  K  for  all  t  e  T. 

It  is  now  easy  to  prove  the  desired  result. 

Theorem  29.  We  can  find  a  continuous  function  /  :  T  C  with  f(n)  =  Ofor  n  <  0 
and  a  bijection  a  :  N  N  such  that 

sup  I Y]  f{(7{j))  exp(2mjt)\  =  c» 

for  all  i  G  T. 


If  instead  of  considering  complex  valued  functions  and  the  orthonormal  system  exp  27rmt, 
we  wish  to  consider  real  valued  functions  and  the  orthonormal  system  formed  by  sin  ‘I'Knt 
and  cos  27Ynt  then  we  can  replace  the  P  of  Lemma  28  by  +  SP  and  first  sum  the 
cos27ta{j)t  terms  and  then  the  sin 27r cr(j)i  terms.  The  existence  of  a  continuous  function 
whose  rearranged  Fourier  series  diverges  everywhere  was  first  proved  by  L.  V.  Taikov  [10] 
using  Olevskii’s  theorem  in  a  different  way. 

So  far  as  I  know  the  general  question  remains  open.  In  particular  we  may  ask  the  follow¬ 
ing  question. 


Question  30.  Consider  the  Haar  system  on  [0, 1)  ordered  in  some  way.  Does  there  exist 
a  continuous  function  /  :  [0, 1]  — C  and  a  bijection  a  :  N  —)•  N  such  that 


for  all  t  £  [0, 1),? 


sup 

k^O 


k 

j=0 


=  oo 


We  remark  that  a  similar  proof  to  the  one  above  (replacing  the  non-trivial  lemma  of  Ka- 
hane  and  Katznelson  by  a  trivial  parallel  lemma)  gives  the  following. 


Lemma  31.  Consider  the  Haar  system  on  [0, 1)  ordered  in  some  way.  There  exists  a  real 
/  G  and  a  bijection 

a:N^N 


such  that 


for  all  t  G  [0, 1). 


sup 


k 

/(x<7(j))X(70')  W 


=  CXD 
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The  method  of  proof  applies  to  any  ‘well  behaved’  system  (though  the  ‘unfair  example’ 
with  which  we  began  the  section  shows  that  it  can  not  apply  to  all  Riesz  bases). 


6.  Hard  summation 


The  reader  with  more  practical  interests  will  observe  that  striking  as  Kolmogorov’s  the¬ 
orem  and  its  generalisations  may  be,  they  do  not  answer  the  question  in  the  case  of  any  par¬ 
ticular  rearrangement.  Let  us  return  to  the  question  we  started  with  but  apply  it  to  a  general 
othonormal  system  (/>i,  (/12, . . . .  If  /  G  we  consider 

finUnii) 

where  /(n)  =  (/,  </>„)  is  the  usual  Fourier  coefficient  for  the  given  system.  This  method  of 
forming  a  sum  is  called  ‘hard  summation’  and,  as  we  said  at  the  beginning  of  the  talk,  is  a 
very  natural  one  to  use. 

(The  reader  may  well  ask  what  ‘soft  summation’  is.  Here  we  consider  something  like 

06  XI  /(«)'/'"(*)+  X 

\nn)\^6  6^\f(n)\^6/2  ' 

I  suspect  that  the  idea  of  soft  summation  derives  from  the  use  of  Ces^o  sums  and  similar 
filtering  techniques  to  improve  the  behaviour  of  Fourier  sums,  I  also  suspect  that  the  anal¬ 
ogy  is  false  since  I  know  of  no  reason  why  soft  summation  should  behave  better  than  hard 
summation.  However,  this  is  merely  opinion,  and  like  all  opinion  liable  to  be  disproved  by 
fact.) 

In  [5]  I  constructed  the  following  example. 

Theorem  32.  There  exists  a  f  £  Z/^(T)  such  that 


for  almost  all  i  €  T. 


lim  sup 

7?->0+ 


/ («)  exp  27rmt 


=  00. 


Later  in  [6]  I  showed  that  we  could  take  /  continuous. 

It  therefore  came  as  a  great  surprise  to  me  to  learn  that  Tao  had  proved  the  following 
theorem 


Theorem  33.  hard  summation  works  for  well  behaved  ( rapidly  decreasing)  wavelets. 

Still  more  surprising  was  the  simple  nature  of  his  proof.  In  order  to  show  how  it  works  I 
will  prove  it  in  a  special  case  where  the  argument  is  particularly  clean. 
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Theorem  34.  Consider  the  Haar  system  on  T.  If  J  :T  ^  Cis  continuous  then 

^  fix)x^f 

\f{x)\^n 

uniformly  on  T. 

[The  exceptionally  good  behaviour  of  the  Haar  system  means  that  just  as  in  some  sense 
it  is  the  hardest  to  prove  a  Kolmogorov  type  theorem  for,  so  it  is  in  some  sense  the  easiest  to 
prove  a  Tao  type  theorem  for.  However,  I  think  all  the  essential  ideas  can  be  seen  even  in  this 
simple  special  case.] 

The  reader  should  notice  that  we  now  use  a  different  normalisation  for  the  Haar  functions 
since  we  want  an  orthonormal  system  in  which  (x,  x)  —  I-  We  shall  return  to  this  point  when 
we  consider  Lemma  37. 

Tao’s  idea  is  to  use  one  of  the  key  characters  in  20th  century  harmonic  analysis  —  the 
maximal  function.  If  we  write 

PNf[t)  = 

rank(x)^N 

vw=  E 

\f{x)\^r} 

then  corresponding  maximal  functions  are 

P*f(t)  =  sup|Pw(/)(t)| 

57(t)  =  sup|5,(/)(t)|. 

T?>0 

Since  the  Haar  system  is  so  well  behaved  P*  is  easy  to  bound. 

Lemma  35.  |P7(i)l  <  ll/llco- 

Proof.  Observe  that  is  piecewise  constant  with  P^fit)  taking  the  average  value 

of  /  on  the  interval  of  the  form  {q  +  1)2“^)  to  which  t  belongs.  Thus  lPiv/(t)|  ^ 

||/||ooforallA^andso|P7(t)|  <  ||/||c«.  □ 

In  general  maximal  functions  are  not  so  easy  to  bound  and  the  main  work  of  the  proof  of 
Tao’s  result  lies  in  obtaining  a  bound  for  S*. 

Theorem  36.  There  exists  a  constant  K  such  that  if  f  :T  C  is  continuous  then 

\S*f(t)\  <  K\\f\U 

Proof  of  Theorem  34  from  Theorem  36.  Let  e  >  0.  Then,  since  P/v/(i)  ->  f 
uniformly  we  can  find  an  M  such  that  ||Pa//  —  /||oo  <  €.  Set  ^  =  /  —  PM(f)-  We  observe 
that  g{x)  —  0  if  rank  x  ^  and  Pm(/)(x)  =  0  if  rank  x  ^  +  L 
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Let  5o  =  1  if  PmU)  =  0  and  set 

(5o  =  min{|PM(/)(x)l ;  PuiDix)  +  0} 

Otherwise.  If  >  0  we  have 

and  so 

WSsif)  -  /Iloo  <  \\PM{f)  -  /Hoc  +  \\Ss{g)\\oo  ^  ^  (if  +  l)e 

using  Theorem  36.  Since  e  was  arbrary  the  result  follows.  □ 

In  the  proof  of  Theorem  36  we  look  out  for  ways  in  which  (well  behaved)  wavelets  differ 
from  their  Fourier  counterparts.  The  first  is  that  although  the  Riemann  Lebesgue  lemma 
represents  the  strongest  statement  we  can  make  about  the  decrease  of  Fourier  coefficients, 
much  stronger  results  hold  for  wavelets. 

Lemma  37.  Iff  :  [0, 1]  ^  C  is  continuous  and  x  has  rank  n  then 

l/(x)l  <  ll/IU2-"/^ 

Proof.  Direct  calculation  gives 

l/(x)l  =  1/  /(i)x(<)c*<  <  Il/Iloc||xl|oo|suppx| 

Msuppx 

=  =  l|/||oo2-"/^ 

Note  how  the  normalisation  of  x  comes  into  the  calculation.  □ 

The  second  and  best  known  difference  is  the  localisation  property  of  wavelets.  This  will 
play  an  important  role  in  our  estimate  of  the  maximal  function  S*f. 

Proof  of  Theorem  36.  .  It  is  suficicient  to  show  that  if  ||/l|oo  =  1»  and  1  ^  6  >  0 
then  |5^(/)(i)|  ^  K  for  some  fixed  K.  To  this  end,  choose  n  so  that  6  >  / 2 

and  observe  that  by  Lemma  37  this  means  that  |/(x)|  for  rankx  >  n. 
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Thus 

\Si{f)(t)\  =  p„(/)(t)  -  Y,  /WxW 

rankx^n,  l/(x)!<<^ 

<  \Pn{f){t)\  +  I  E  /WXWI 

rankx^n,  |/{x)l<^ 

^F*(/)W+5  IxWI 

rankx^n 

<ll/lloo  +  <5  Yl  IxWUW^o 
rank  x^n 

<  l  +  5^2’'/2 

0 

since  exactly  one  x  of  each  rank  is  non-zero  at  t  (this  is  an  extreme  version  of  the  localisation 
property  of  wavelets). 

Doing  some  simple  calculations  we  have 

”  ,  A2(«+1)/2  2^/2 

\smt)\  ^  1  +  <  1  +  2I7JTT  ^  1  +  217531’ 

0 

which  is  an  inequality  of  the  required  form  (with  K  =  1  +  2^/^/ (2^/^  -  1)).  □ 

To  extend  beyond  continuous  functions  we  need  to  use  the  Hardy-Littlewood  maximal 
operator 

M/(i)  =  sup—  /  \f{x)\dx. 

r>0  Jt-r 

If  we  do  so,  we  get  pointwise  convergence  almost  everywhere  for  /  G  i/  [1  ^  p  ^  oo]  for 
the  Haar  system. 

We  end  by  stating  Tao’s  general  theorem  from  [9].  He  works  on  R  which  is  a  more  useful 
space  than  T. 

Theorem  38  (Tao).  We  work  on  R  Suppose  cj)  has  integral  zero,  and  is  bounded  and 
rapidly  decreasing.  Suppose  that  the  set  of  functions  (wavelets)  given  by 

,j>,,,{x)  =  -  k)) 

form  an  orthonormal  system.  Then,  if  f  €  //(R)  [1  <  p  <  oo],  we  have 

Y2  f(<i>3,k)'i>i,k(^)  ->  f{x) 

\H<pj.k)\>k 

almost  everywhere  as  X  ^  0-1-. 
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Wavelet  expansions,  function  spaces  and 
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Abstract.  The  purpose  of  this  tutorial  is  to  describe  the  interplay  between  three 
subjects:  function  spaces,  wavelet  expansions,  and  multifractal  analysis.  Some  rela¬ 
tionships  are  now  classical.  Wavelet  bases  were  immediately  considered  as  remark¬ 
able  by  analysts  because  they  are  unconditional  bases  of  ‘most’  function  spaces.  This 
property  is  a  key  feature  of  the  denoising  algorithms  of  Donoho,  for  instance,  mul¬ 
tifractal  analysis  tries  to  derive  the  Hausdorff  dimensions  of  the  Holder  singularities. 

Wavelet  techniques  proved  the  most  efficient  tool  in  the  numerical  computation  of  the 
spectra  of  singularities  of  turbulent  flows. 

Our  purpose  is  first  to  present  these  points,  and  then  to  show  how  ideas  have 
developed  in  the  recent  interplay  between  these  three  fields. 

Refinements  of  the  numerical  techniques  introduced  to  compute  turbulence  spec¬ 
tra  have  led  to  the  introduction  of  new  function  spaces,  which  turn  out  to  be  the  right 
setting  to  determine  the  fractal  dimensions  of  graphs,  and  offer  natural  extensions  of 
the  Besov  spaces  to  negative  p’s. 

The  ‘function  space  setting’  allows  one  to  derive  Baire-type  results  for  the  value 
of  spectra. 

Keeping  the  histograms  of  wavelet  coefficients  gives  richer  information  than  just 
keeping  the  moments  of  these  histograms  (which  corresponds  to  keeping  only  the 
knowledge  of  the  function  spaces  to  which  the  fimction  belongs).  We  compare  the 
probabilistic  results  (obtained  from  histograms)  with  the  above  Baire-type  results. 

I.  Introduction 

We  will  describe  the  interplay  between  three  subjects:  function  spaces,  wavelet  expan¬ 
sions,  and  multifractal  analysis.  Some  relationships  are  now  well  established: 
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•  Wavelet  bases  were  immediately  considered  as  remarkable  because  they  are  uncondi¬ 
tional  bases  of  ‘most’  function  spaces.  This  property  has  very  practical  implications; 
for  instance,  it  is  a  key  feature  of  the  signal  denoising  algorithms  introduced  by  David 
Donoho  and  his  collaborators,  see  [11]. 

•  The  purpose  of  the  multifractal  formalism  introduced  by  Uriel  Frisch  and  Georgio 
Parisi  is  to  derive  the  Hausdorff  dimensions  of  the  Holder  singularities  of  a  function 
(the  so-called  ‘spectrum  of  singulgularities’)  from  the  knowledge  of  the  Besov  spaces 
to  which  this  function  belongs,  see  e.g.  [17]. 

•  Among  the  several  variants  of  the  multifractal  formalism  that  have  been  introduced, 
those  based  on  wavelet  techniques  proved  the  most  efficient  for  the  numerical  compu¬ 
tation  of  the  spectra  of  singularities  of  turbulent  flows,  see  [2]. 

We  will  present  these  topics,  and  show  how  ideas  which  developed  in  these  three  fields 
interplayed  recently;  we  will  particularly  focus  on  the  following  points: 

Refinements  of  the  numerical  techniques  introduced  to  compute  turbulence  spectra  have 
led  to  the  introduction  of  new  function  spaces,  which  recently  found  unexpected  applications: 
they  turn  out  to  be  the  right  setting  to  determine  the  fractal  dimensions  of  graphs,  and  they 
offer  natural  extensions  of  the  Besov  spaces  when  the  exponent  p  takes  negative  values. 

The  knowledge  of  the  Besov  spaces  to  which  a  collection  of  functions  belongs  allows  one 
to  derive  quasi-sure  results  (in  the  sense  of  Baire’s  categories)  for  the  value  of  the  spectra  of 
singularities  of  these  functions. 

In  image  or  signal  processing,  Besov  regularity  is  usually  a  by-product,  deduced  from 
the  knowledge  of  the  histogram  of  the  wavelet  coefficients  at  each  scale  j;  so  that  more 
information  is  actually  available.  We  will  determine  the  maximal  information  which  can  be 
derived  from  the  wavelet  histograms  and  is  independent  of  the  wavelet  basis  chosen.  Note 
that  working  on  histograms  of  wavelet  coefficients  is  not  new;  for  instance  cascade-type 
models  for  the  evolution  of  the  p.d.f.  (probability  density  function)  of  the  wavelet  coefficients 
through  the  scales  have  been  proposed  to  model  the  velocity  in  the  context  of  fully  developed 
turbulence,  see  [3].  We  will  compare  probabilistic  results  (obtained  from  drawing  at  random 
wavelet  coefficients  at  each  scale  inside  such  a  preassigned  sequence  of  histograms)  with  the 
Baire-type  results. 

2.  Wavelets  as  unconditional  bases  of  Besov  spaces 

The  most  important  basis  in  analysis  has  certainly  been  the  trigonometric  system.  This 
is  so  because  the  resolution  of  several  key  problems  in  physics  is  particularly  simple  when 
formulated  in  this  setting.  Unfortunately,  the  convergence  of  the  corresponding  series  posed 
important  mathematical  problems  since  Du  Bois-Reymond  showed  in  1873  that  the  Fourier 
series  of  a  continuous  function  may  diverge.  (See  [23]  where  the  fine  properties  of  Fourier  se¬ 
ries  and  the  development  of  ideas  that  led  to  wavelet  analysis  are  described.)  Is  this  phenom¬ 
enon  inherent  to  any  orthogonal  decomposition?  Hilbert  posed  this  problem  to  his  student 
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Alfred  Haar,  who  gave  a  negative  answer  in  his  thesis  by  constructing  in  1909  the  following 
orthonormal  basis  of  L^([0, 1]).  It  is  composed  of  the  function  1,  and  of  the  defined  by 

(2.1)  '>Pi,k{x)  =  2^/^ip(2’x  -  k) 

where  j  0,k  =  0, . .  =  1  [0,1/2]  - 1  [1/2,1],  and  l^i  denotes  the  characteristic  func¬ 

tion  of  the  set  A.  One  can  also  omit  the  fiinction  1,  but  use  all  positive  and  negative  integer 
values  of  j  and  k  and  thus  obtain  an  orthonormal  basis  of  (M) .  Haar  showed  that  the  partial 
sums  of  the  decomposition  of  a  continuous  function  in  this  basis  are  uniformly  convergent. 
The  comparison  with  the  trigonometric  system  is  striking:  A  basis  composed  of  discontin¬ 
uous  functions  is  more  adapted  to  the  analysis  and  reconstruction  of  continuous  functions 
than  the  trigonometric  system,  though  this  system  is  composed  of  C°°  functions.  The  Haar 
basis  has  another  important  property  which  the  trigonometric  system  lacks:  Marcinkiewicz 
showed  in  1937  that  it  is  an  unconditional  basis  of  the  spaces  Lf  when  1  <  p  <  00;  this 
means  that  any  function  of  If  can  be  written  in  only  one  way  as  Cj^k'^j^k  and  the  conver¬ 
gence  is  unconditional,  i.e.  does  not  depend  on  the  order  of  summation.  This  result  still  has 
important  implications  in  current  research.  In  1999,  Bourgain,  Brezis  and  Mironescu  used  the 
characterization  of  Lf  on  the  Haar  basis  as  a  key  tool  in  the  lifting  problem,  which  consists 
in  determining  if  any  function  /  which  belongs  to  a  given  function  space  and  satisfies  |/|  =  1 
can  be  written  f{x)  =  where  6  belongs  to  the  same  function  space.  (When  the  answer 
is  positive,  this  is  a  key -step  in  the  linearization  of  some  nonlinear  PDEs  where  the  constraint 
I/I  =  1  is  imposed  by  the  physics,  such  as  in  the  Ginzburg-Landau  model,  see  [8].) 

Of  course,  since  the  Haar  basis  is  not  composed  of  continuous  functions,  it  cannot  be  a  ba¬ 
sis  for  spaces  of  continuous  functions.  This  last  remark  motivated  researches  to  ‘smooth’  the 
Haar  basis.  The  goal  was  to  construct  bases  of  the  similar  algorithmic  type,  and  which  would 
be  unconditional  bases  of  as  many  function  spaces  as  possible.  In  1910,  Faber  considered  on 
[0, 1]  the  basis  composed  of  1,  rr  and  the  primitives  of  the  Haar  basis.  This  Schauder  basis 
(so-called  because  it  was  rediscovered  by  Schauder  in  1927)  has  the  same  algorithmic  form  as 
the  Haar  basis:  it  is  still  of  the  type  (2.1)  where  xp  is  the  primitive  of  the  Haar  wavelet.  Faber 
showed  that  this  system  is  a  basis  of  the  continuous  functions  on  [0, 1].  The  price  to  be  paid 
is  that  it  is  no  longer  a  basis  of  Should  one  necessarily  lose  on  one  hand  what  has  been 
obtained  by  the  other?  In  1928,  Franklin  showed  that  you  can  have  your  cake  and  eat  it  by 
applying  the  Gram-Schmidt  orthonormalization  procedure  to  the  Schauder  basis,  thus  obtain¬ 
ing  a  basis  which  is  simultaneously  unconditional  for  all  spaces  i/([0, 1])  (1  <  p  <  00),  for 
(7([0, 1])  and  for  the  Sobolev  spaces  of  low  regularity.  One  can  go  on  and  iterate  one  step  of 
integration  (which  regularizes)  and  one  step  of  Gram-Schmidt  orthonormalization;  Ciesielski 
thus  constructed  in  1972  bases  which  are  unconditional  for  a  wider  and  wider  range  of  func¬ 
tion  spaces  on  [0, 1].  Of  course,  applying  the  Gram-Schmidt  orthonormalization  procedure 
iteratively  makes  these  bases  essentially  impossible  to  compute  numerically:  Something  has 
been  lost  in  the  end:  algorithmic  simplicity.  It  is  therefore  no  wonder  that  the  Ciesielski  bases 
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were  never  used  in  practical  applications.  (Note  that  this  is  in  sharp  contrast  to  the  Haar  basis 
which,  despite  its  lack  of  regularity,  has  been  widely  used  in  image  processing.) 

However,  algorithmic  simplicity  and  regularity  can  go  together.  In  1981,  Stromberg  had 
the  idea  of  applying  the  Gram-Schmidt  orthonormalization  on  the  whole  line  instead  of  on 
[0, 1]  (loosely  speaking,  one  starts  the  orthonormalization  at  -oo).  Because  of  the  dilation 
and  translation  invariance  of  the  real  line,  this  substitute  of  the  Shauder  basis  now  has  the 
exact  algorithmic  form  (2.1).  Starting  the  orthonormalization  with  B-splines  of  arbitrary  high 
degree,  Stromberg  thus  constructed  orthonormal  wavelet  bases  of  arbitrarily  large  regularity. 
These  bases  are  unconditional  for  a  wider  and  wider  range  of  Sobolev  or  Besov  spaces.  The 
ultimate  perfection  was  found  by  Yves  Meyer  and  Pierre-Gilles  Lemari^  who  constructed,  in 
1986,  wavelets  such  that  the  functions 

(2.2)  -k),  j  e  Z,  fc  6  Z'' 

form  an  orthonormal  basis  of  this  basis  allows  one  to  characterize  functions  of 

arbitrary  regularity  (or  distributions  of  arbitrary  irregularity  by  duality),  see  [27].  In  order  to 
be  more  specific,  we  start  by  introducing  some  notation. 

Wavelets  and  wavelet  coefficients  will  be  indexed  by  dyadic  cubes;  A  will  denote  the  cube 
Ajyfc  =  k2~^  +  [0,2“^]^^,  ipx  will  denote  the  wavelet  —  A:)  (note  that  we  ‘forget’  to 

write  the  index  i  of  the  wavelet,  which  is  of  no  consequence).  Thus 

(2.3)  /(i)  =  ^C;^V’aW, 

A 

where  the  wavelet  coefficients  of  /  are  given  by 

(Note  that  we  do  not  use  the  usual  normalization;  the  natural  normalization  for  the  prob¬ 
lems  we  will  consider  is  the  normalization.) 

Letp  €  (1,  +oo)  and  s  ^  0;  by  definition,  the  Sobolev  space  is  composed  of  the 

functions  of  whose  fractional  derivatives  of  order  s  also  belong  Lp(IR‘^).  A  function 

/  belongs  to  i/’^(IR^)  if  and  only  if  its  wavelet  coefficients  satisfy  the  following  condition, 
see  [29] 


1/2 


(2.4) 


A,z 


(Note  the  sharp  contrast  with  Fourier  series:  When  p  2,  there  exists  no  characterization  of 
by  conditions  on  the  moduli  of  the  Fourier  coefficients.) 

These  characterizations  are  quite  difficult  to  handle  and,  in  the  context  of  wavelet  analysis, 
Besov  spaces  are  preferred  for  the  two  following  reasons: 
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•  They  are  very  close  to  the  Sobolev  spaces,  as  shown  by  the  following  embeddings 


y,s—€ 


Ve>0,  Vp^l,  Vq,  ^  ^ 

•  They  have  a  very  simple  wavelet  characterization,  see  [7],  [26]  and  [29], 


i/p 


(2.5)  /  e 


k  i,k  i 


=  with  {€j)j^z  e  I”. 


Note  that  in  all  such  characterizations,  wavelets  are  assumed  to  be  smooth  enough,  say,  with 
at  least  derivatives  up  to  order  [s]  + 1  having  fast  decay  (see  [7]  for  optimal  regularity  assump¬ 
tions  on  the  wavelets).  In  sharp  contrast  with  the  Sobolev  case,  Besov  spaces  are  defined  for 
any  p  >  0. 

The  global  regularity  information  about  a  function  /  is  given  by  its  Besov  domain  Bf, 
which  is  the  set  of  {q,  s)  such  that  /  belongs  to  By  interpolation,  the  Besov  domain 

has  to  be  a  convex  subset  of  IR^,  and  the  Besov  embeddings  imply  that,  if  (g,  s)  belongs  to 
Bj,  then  the  segment  joining  (g,  s)  and  (0,  s  -  dq)  also  belongs  to  Bf,  see  [31].  It  follows 
that  the  boundary  of  the  Besov  domain  is  the  graph  of  a  function  5(g)  which  is  concave  and 
satisfies 


(2.6)  0  ^  s'{q)  ^  d. 

The  following  proposition  of  [19]  shows  that  (2.6)  characterizes  the  possible  functions  s(g). 

Proposition  1.  Any  concave  function  that  s(g)  satisfies  (2.6)  defines  the  boundary  of 
the  Besov  domain  of  a  distribution  f. 

One  of  the  reasons  for  the  success  of  wavelet  decompositions  in  applications  is  that  they 
often  lead  to  very  sparse  representations  of  signals.  This  sparsity  can  be  characterized  by 
determining  to  which  Besov  spaces  Bp^  the  function  considered  belongs  when  p  is  close  to 
0.  Let  us  illustrate  this  assertion  by  an  example.  Consider  the  function 

H{x)  =  1  if  la;|  ^  1, 

=  0  elsewhere, 

and  suppose  that  the  wavelet  used  is  compactly  supported,  say  on  [-A,  A].  For  each  j,  there 
are  less  than  4A  non- vanishing  wavelet  coefficients,  so  that  the  wavelet  expansion  of  /  is 
extremely  sparce.  Since  H{x)  is  bounded,  |c;^|  ^  C  VA.  Using  (2.5),  it  follows  that  H{x) 
belongs  to  as  soon  as  s  <  1/p.  Let  us  check  that,  conversely,  this  property  is  a  way 

to  express  that  the  wavelet  expansion  of  /  is  sparce.  We  suppose  that  a  bounded  function  / 
satisfies 
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We  claim  that  VA  >  0,  Ve  >  0,  at  each  scale  j  there  are  less  than  C(e,  A)2^^  coefficients  of 
size  larger  than  Indeed,  if  it  were  not  the  case,  taking  p  =  e/(2^),  we  get  |ca|^  — ^ 
+00  when  j  — )•  +oo,  hence  a  contradiction. 

Here  is  another  illustration  of  the  relationship  between  sparcity  of  the  wavelet  expansion 
and  Besov  regularity.  Suppose  that  /  belongs  to 

Pi  . 

p>0 

Going  back  to  (2.5),  this  condition  exactly  means  that  the  sequence  belongs  to  for  all 
P>  0,  which  is  also  equivalent  to  the  fact  that  the  decreasing  rearrangement  of  the  sequence 
\c^\  has  fast  decay,  which,  again,  is  a  way  to  express  sparcity,  see  [21]. 

Besov  spaces  when  p  <  1  are  no  longer  locally  convex,  which  partly  explains  the  diffi¬ 
culties  met  when  using  them.  Before  the  introduction  of  wavelets,  these  spaces  were  either 
characterized  by  the  order  of  approximation  of  /  by  rational  functions  whose  numerator  and 
denominator  have  a  given  degree,  or  equivalently  by  the  order  of  approximation  by  splines 
with  ‘free  nodes’  (which  means  that  the  points  where  the  piecewise  polynomials  are  con¬ 
nected  are  left  free,  and  can  thus  be  fitted  to  the  function  considered),  see  [10]  and  [21]. 
However  such  characterizations  were  much  more  difficult  to  handle,  and  of  hardly  any  use  in 
numerical  applications. 

3.  Beyond  Besov  spaces:  oscillation  spaces 

One  important  drawback  when  using  Besov  spaces  is  that  any  information  concerning 
possible  correlations  on  the  position  of  large  wavelet  coefficients  is  lost,  since  the  wavelet 
norm  (2.5)  is  clearly  invariant  under  permutations  of  the  wavelet  coefficients  at  the  same 
scale;  this  can  be  a  drawback.  For  instance,  piecewise  smooth  functions  clearly  have  their 
large  wavelet  coefficients  located  at  the  singularities,  so  that  such  functions  exhibit  very 
strong  correlation  between  the  positions  of  large  wavelet  coefficients.  Let  us  show  another 
occurrence  of  this  problem,  concerning  the  computation  of  the  fractal  dimensions  of  graphs. 

Definition  1.  Let  K  be  a  bounded  subset  N{K,j)  will  denote  the  number 

of  dyadic  cubes  of  size  2~^  necessary  to  cover  K.  The  fractal  dimension  of  K  (also  called 
upper  box  dimension)  is 

j->C50  3  tOS  2 

If  K  is  the  graph  of  a  compactly  supported  continuous  function,  N{K,  j)  is  related  to  the 
oscillation  of  /. 

Definition  2.  Let  \  be  a  dyadic  cube  included  in  iff  is  a  continuous,  real  valued 
function  defined  on  IR^,  let 

(3.1)  osc(/,  A)  =  sup  f{x)  -  inf  f{x) 

xeA 
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denote  the  oscillation  off  on  the  cube  A.  The  p-oscillation  off  at  scale  j  is  defined  by 

OsCpifJ)  =  ^2  {osc{f,Xj,k)Y 

k 


where  Xj^k  ^  the  dyadic  cube  k2  ^  +  2  ^[0, 1]^^  (when  p  =  I  one  uses  the  term  'oscillation’ 
instead  of  ‘1-oscillation’ f  The  p-oscillation  exponent  is 


uJf)  ~  liminf 
^■->•00 


log  jOsCpifJ)) 
\og{2-j) 


(The  p-oscillation  is  a  variant  of  the  p-variation,  see  [18].)  It  follows  easily  that  there  exist 
two  positive  constants  C  and  C'  such  that 

(3.2)  C  {2<^^  +  2^0sc^(fJ))  N(Graph{f)J)  <  C'  (2^^  +  2^0sc^{fJ)) . 

Let  us  explain  by  an  example  why  the  box  dimension  of  a  graph  cannot  be  deduced  from 
the  Besov  domain  of  a  function;  we  use  essentially  a  construction  due  to  Anna  Kamont  and 
Barbara  Wolnick,  cf  [25]. 

Consider  the  following  function:  Let  ji  »  jo  >>  0;  all  wavelet  coefficients  of  /, 
defined  on  [0, 1],  vanish  for  j  ^  ju  at  the  scale  ji,  there  are  nonvanishing  wavelet 
coefficients  of  size  Let  us  now  consider  the  two  extreme  possibilities: 

•  All  nonvanishing  wavelet  coefficients  are  packed  in  the  first  locations  {k  = 
1, . . . ,  2^1  “-^0).  In  this  case,  the  oscillation  at  the  scale  2~^°  vanishes,  except  for  the  two 
first  dyadic  intervals,  for  which  it  is  ^  2““^^ . 

•  If  the  nonvanishing  wavelet  coefficients  are  equidistributed,  the  oscillation  at  the  scale 
2  “JO  is  ~  2““Ji  on  each  dyadic  interval  of  length  2"^^. 

The  total  oscillation  at  scale  2“J°  is  thus  ^  2““J^  in  the  first  case,  and  ^  in  the 

second.  We  can  pick  jo  and  ji  such  that  the  oscillations  in  both  cases  are  not  of  the  same 
order  of  magnitude.  By  piling  up  this  construction  on  an  infinite  number  of  scales,  it  is  easy 
to  construct  two  functions  with  the  same  histograms  of  wavelet  coefficients  at  each  scale,  and 
different  box  dimensions  of  graphs.  The  problem  here  is  that  the  box  dimension  of  graphs  is 
clearly  altered  by  clustering  or  spreading  the  large  wavelet  coefficients.  Therefore,  it  can  be 
measured  only  by  norms  that  are  able  to  take  such  phenomena  into  account.  It  is  the  purpose 
of  the  oscillation  spaces  which  allow  one  to  derive  p-oscillation  exponents  from  the  wavelet 
coefficients.  In  the  following.  A'  denotes  the  dyadic  cube  k'2~^'  +  [0, 2~J^. 


Dehnition  3.  Let  p  >  0,  and  s,  s’  £  IR;  then  a  function  f  belongs  to  (!)^’^'(]R^)  if  its 
wavelet  coefficients  satisfy 


The  left  hand-side  defines  the  {fSl^)-seminorm, 


ifp 


|cv2 


s  f  IP 


<  oo. 
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Proposition  2  will  imply  that  this  definition  is  independent  of  the  wavelet  basis  chosen. 
Note  that  this  definition  exhibits  the  property  we  were  looking  for,  and  that  Besov  spaces 
were  lacking.  Namely,  because  of  the  that  appears  in  the  definition,  two  functions 

that  share  the  same  histograms  of  wavelet  coefficients  at  all  scales  may  have  very  different 
norms,  depending  upon  whether  the  large  wavelet  coefficients  are  more  or  less  clustered, 
oscillation  spaces  take  into  account  the  geometric  disposition  of  the  wavelet  coefficients. 


Theorem  1.  If  f  belongs  to  C'^(IR‘^)  for  an  e  >  0,  the  p~oscillation  exponent  of  f  is 
given  by 


log  E  sup  |ca- 


U}p(f)  =  sup{s  :  /  G  =  limsup 

j^+oo 


AeAj 


A'CA 


jlog2 


The  spaces  are  defined  by  conditions  on  the  wavelet  coefficients,  therefore  one  first 
has  to  check  that  their  definition  is  intrinseque,  i.e.,  independent  of  the  wavelet  basis  chosen. 
One  way  to  do  it,  following  [29],  is  to  check  that  condition  (3.3)  is  invariant  under  the  action 
of  the  “infinite  matrices”  which  belong  to  the  algebras  for  7  large  enough;  these  algebras 
are  defined  as  follows,  see  [29]:  A(X^  X')  (indexed  by  the  dyadic  cubes)  belongs  to  if 

^  (1  +  O'  - 

Matrices  of  operators  which  map  a  wavelet  basis  onto  another  belong  to  these  algebras,  and 
more  generally  matrices  (on  wavelet  bases)  of  pseudodifferential  operators  of  order  0,  such  as 
the  Hilbert  transform  in  dimension  1,  or  the  Riesz  transforms  in  higher  dimensions,  see  [29]. 
We  denote  by  Op{M'^)  the  space  of  operators  whose  matrix  on  a  wavelet  basis  belongs  to 
M'^.  The  following  proposition  is  proved  in  [18]. 

Proposition  2.  Ifj  ^  sup(|s|,  |5^|),  then  the  operators  which  belong  to  Op{M^)  are 
continuous  on  0^’^' 

The  following  corollary  is  an  immediate  consequence  of  (3.2)  and  Theorem  1. 

Corollary  1.  Let  /  :  ]R  be  a  compactly  supported  function  which  belongs  to 

C^{]R^)foran  e  >  0.  Then 

dimb(Graph(f))  —  sup  (d,  1  —  sup{s  ;  /  e  OJ'"}) . 

Similarly,  Kamont  recently  proved  the  following  wavelet  characterization  of  the  lower 
box  dimension  of  a  graph  in  terms  of  the  wavelet  expansion  of  the  function,  see  [24].  The 
lower  box  dimension  of  a  set  K  is  by  definition 

-  j-^oo  jlog2 
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Theorem  2.  Let  f  :  ]R^  IR  be  a  compactly  supported  function  which  belongs  to 
C^{JR^)for  an  e  >  0;  let 

'Ej  =  y^  sup  and  1}  =  sup  ( 2~-^'(^~^)  \cx'\ 

then  ^ 

dimi,{Graph{f))  =  1  +  lim inf 

-  j-^oo  jlog2 

Another  (much  more  difficult)  problem  concerning  the  fractal  nature  of  graphs  is  to  de¬ 
termine  their  Hausdorff  dimension.  Let  us  recall  the  definition  of  the  Hausdorff  dimension  of 
a  subset  A  C  IR^.  For  £:  >  0,  let 


t 


where  i?  is  a  generic  covering  of  the  set  A  by  balls  Bi  of  diameter  Si  ^  e,  Then 
dimH(A)  —  sup{d  :  limM^'^  —  +00}  =  mf{d  :  =  0}. 

Extending  the  previously  mentioned  techniques,  Francois  Roueff  proved  that,  if  /  € 
for  an  e  >  0,  the  Hausdorff  dimenson  of  the  graph  of  /  is  bounded  by  d  -I- 1  -  sup{s  ;  /  G 
(actually,  some  sharper  estimates  can  be  found  in  [32]). 

The  wavelet  characterization  given  by  Corollary  1  has  implications  in  rugosimetry.  In¬ 
deed  the  fractal  dimension  of  a  surface  has  been  shown  to  be  a  pertinent  way  to  model  the 
notion  of  rugosity,  see  [12].  (In  [33]  other  generalizations  of  Sobolev  spaces  are  introduced 
following  this  motivation.)  Therefore,  finding  a  numerically  stable  algorithm  to  measure  this 
fractal  dimension  became  an  important  issue.  Numerical  algorithms  based  on  the  oscillation 
are  discussed  in  [12];  alternative  algorithms  based  on  the  wavelet  characterization  of  Corol¬ 
lary  1  should  prove  numerically  more  stable,  since  they  wouldn’t  be  based  directly  on  the 
pointwise  value  of  the  function,  but  on  wavelet  coefficients,  which  are  averaged  quantities, 
and,  as  such,  are  less  sensitive  to  noise. 

Let  us  end  this  section  by  mentioning  a  remarkable  property  of  oscillation  spaces.  In 
the  multifractal  formalism  that  we  will  present  in  the  next  section,  the  spectrum  of  singul- 
gularities  of  a  function  /  is  deduced  from  its  Besov  domain  by  a  Legendre  transform.  One 
drawback  of  this  approach  is  that  Besov  spaces  are  defined  only  for  positive  p’s.  Thus,  at  best, 
this  method  allows  one  to  recover  the  increasing  part  of  the  concave  hull  of  the  spectrum;  ob¬ 
taining  the  decreasing  part  would  involve  the  extension  of  the  Besov  domain  for  negative  p’s, 
which  is  clearly  absurd  when  starting  from  any  of  the  usual  definitions  of  Besov  spaces.  On 
the  contrary,  oscillation  spaces  have  a  natural  extension  to  negative  p’s.  Let  us  sketch  how 
this  extension  can  be  derived.  First,  we  remark  that  Definition  3  can  be  rewritten 


(3.4) 


VjGZ 
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Note  that,  in  particular,  applying  this  condition  for  j  —  0,  we  obtain  that,  for  any  f  ^  0, 
|ca'|  <  or,  in  other  words,  /  €  Conversely,  the  condition  /  €  is 

necessary  to  make  sure  that  the  suprema  in  (3.4)  are  finite.  Therefore,  we  adopt  the  following 

Definition  4.  Let  p  <  0,  and  s,s'  e  IR;  then  a  function  f  belongs  to  (IR'^)  if  f 
belongs  to  C^'(IR^)  and  if  its  wavelet  coefficients  satisfy 

VjGZ  VfsupM^'^'lV 

k  / 

The  remarkable  property  of  this  definition  is  that  it  is  ‘almost’  independent  of  the  wavelet 
basis  (in  the  Schwartz  class)  which  is  chosen.  More  precisely,  the  spaces  ns">s  Op’^'  are 
‘intrinseque’,  i.e.,  are  independent  of  the  wavelet  basis,  and  more  generally  are  invariant 
under  the  action  of  an  operator  whose  matrix  A  satisfies 

and  e  f] M’’. 

7>0  7>0 

4.  Multifractal  analysis  and  the  Frisch-Parisi  formula 

Large  classes  of  signals  exhibit  a  very  irregular  behavior.  In  the  wildest  situations,  this 
irregularity  may  follow  different  regimes,  and  can  switch  from  one  regime  to  another  almost 
instantaneously.  This  is  the  case  for  recordings  of  speech  signals;  precise  recordings  of  tur¬ 
bulence  data  (which  became  available  at  the  beginning  of  the  80s)  showed  that  turbulence 
also  falls  in  this  category.  Such  signals  cannot  be  modeled  by  standard  stationary  increments 
processes,  such  as  fractional  brownian  motions.  The  techniques  of  multifractal  signal  analy¬ 
sis  have  been  specifically  designed  to  analyze  such  behavior.  Initially  developed  in  the  mid 
80’s  in  the  context  of  turbulence  analysis,  they  were  applied  successfully  to  a  large  range  of 
signals,  including  traffic  data  (cars  and  internet),  stock  market  prices,  speech  signals,  texture 
analysis,  DNA  sequences... (see  [1]  and  [28]  for  instance). 

We  start  by  introducing  the  definitions  related  to  pointwise  regularity.  Let  o;  be  a  positive 
real  number  and  e  IR”^;  a  function  /  :  ->  IR  is  C^{xo)  if  there  exists  a  polynomial  P 

of  degree  less  than  a  such  that 

(4.1)  \f{x)  -  P(x  -  a:o)|  <  C\x  -  XoT- 

The  Holder  exponent  h f(xo)  is  the  supremum  of  all  the  values  of  a  such  that  (4.1)  holds. 
We  are  interested  in  analyzing  signals  whose  Holder  exponent  may  widely  change  from  point 
to  point.  This  instability  usually  makes  the  task  of  determining  the  Holder  exponent  hf(x) 
very  difficult  numerically.  This  is  the  case  for  multifractal  functions,  where  the  Holder  expo¬ 
nent  jumps  from  point  to  point.  In  that  case,  points  with  a  given  Holder  exponent  form  fractal 
sets,  and  one  is  not  interested  in  determining  the  exact  value  of  the  Holder  exponent  at  every 
point  but  rather  in  extracting  some  relevant  information  concerning  the  size  and  geometry  of 
the  Holder  singularities.  The  relevant  quantity  is  the  spectrum  of  singulgularities. 
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Definition  5.  Let  Ah  be  the  set  of  the  points  x  where  hf{x)  —  H.  The  domain  of 
definition  of  the  spectrum  of  singulgularities  d{H)  is  the  set  of  H's  such  that  Ah  is  not 
empty.  If  this  is  the  case,  d{H)  is  the  Hausdorff  dimension  of  Ah >  Otherwise,  if  H  is  not  a 
value  taken  by  hf,  d{H)  —  — oo. 

It  is  clearly  impossible  to  estimate  numerically  the  spectrum  of  singulgularities  of  a  signal 
since  it  involves  the  successive  determination  of  severaJ  intricate  limits,  and  a  blind  applica¬ 
tion  of  the  formula  giving  the  definition  of  the  Hausdorff  dimension  would  yield  enormous, 
totally  unstable  calculations.  The  only  method  is  to  find  some  ‘reasonable’  assumptions  under 
which  the  spectrum  could  be  derived  using  only  averaged  quantities  (which  should  be  numer¬ 
ically  stable)  extracted  from  the  signal.  Such  formulas,  called  multi  fractal  formalisms,  were 
inferred  first  by  physicists. 

The  initial  formulation  asserts  that  the  spectrum  of  Holder  singularities  of  a  function  can 
be  recovered  from  the  scaling  function  7?/(p),  defined  for  p  >  Oby 


rjf{p)  —  sup{s  :  /  G  =  d  -f  liminf 

y  ji— >+oo 


It  follows  immediately  from  the  definition  of  s{q)  that 
(4.2)  r)f(p)=ps(l/p). 


cj"  /(log  2-0- 


The  multifractal  formalism  may  be  surprising  at  first  sight  because  it  relates  pointwise  be¬ 
havior  (Holder  exponents)  to  global  estimates  (Besov  regularity).  Before  studying  its  mathe¬ 
matical  validity,  it  may  be  enlightening  to  give  the  heuristic  argument  from  which  it  is  derived. 
Though  this  argument  cannot  be  transformed  into  a  correct  mathematical  proof,  it  shows  at 
least  why  these  formulas  can  be  expected  to  hold,  and  a  careful  study  of  its  implicit  asump- 
tions  shows  its  limitations.  This  argument  can  be  decomposed  into  four  steps,  each  involving 
specific  asumptions  that  we  will  state  explicitely  in  order  to  make  clear  the  conditions  under 
which  the  formalism  can  be  expected  to  hold.  It  is  initially  based  on  the  following  character¬ 
ization  of  the  Holder  exponent  based  on  decay  estimates  of  the  wavelet  coefficients,  see  [14] 
(or  [20]  for  the  sharpest  results). 


Proposition  3.  If  f  belongs  to  0^(11^^)  for  an  e  >  0(i.e.  if\cx\  ^  C2  the  Holder 
exponent  of  f  at  each  point  xq  is  given  by 

(4.3)  b,f(xQ)  —  liminf  inf  - —  - rr- 

^  ^  ^  j-^oo  k  log(2-J -+■  |A;2-J -a^ol) 


Step  1:  The  first  asumption  in  the  derivation  of  the  multifractal  formalism  is  that  the 
Holder  exponent  of  /  at  every  point  xq  is  given  by  the  rate  of  decay  of  the  wavelet  coefficients 
of  /  in  a  cone  1^2”^  —  a;o|  ^  C2~f  Coming  back  to  (4.3),  if  these  coefficients  decay  like 
2~^\  we  expect  that  hf{xo)  =  H.  This  statement  is  wrong  in  full  generality  but  is  true  under 
the  hypothesis  that  /  has  only  cusp-typc  singularities  (see  [30]). 
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Step  2:  We  estimate,  for  each  H,  the  contribution  of  the  Holder  singularities  of  exponent 
H  to  the  quantity 

(4.4) 

A: 

Each  such  singularity  brings  a  contribution  of  We  need  about  cubes  of  width 

to  cover  these  singularities;  the  total  contribution  of  the  Holder  singularities  of  exponent 
H  to  (4.4)  is  thus 

This  is  clearly  a  critical  step  in  the  argument;  it  involves  an  inversion  of  limits  which  supposes 
that  all  Holder  singularities  start  to  have  coefficients  ^  2"^^  at  a  certain  scale  J,  and  that  the 
Hausdorff  dimension  is  estimated  as  a  box  dimension.  It  is  remarkable  that  the  multifractal 
formalism  is  valid  in  many  situations  where  these  two  hypotheses  do  not  hold. 

Step  3:  This  consists  of  a  steepest  descent  argument.  When  j  ->  -Foo,  among  the  terms 

(4.5) ,  the  one  which  yields  the  main  contribution  to  (4.4)  is  obtained  for  the  exponent  H 
realizing  the  infimum  of  Hp  —  d{H);  hence 

v{p)  -  d  =  inf  {Hp  -  d(H)) . 

H 

Step  4:  If  d{H)  is  a  concave  function,  —d{H)  and  —T]{p)  +  d  are  convex  conjugates,  and 
each  can  be  recovered  from  the  other  by  a  Legendre  transform;  it  follows  that 

(4.6)  d{H)  =  inf  {Hp  -  rj{p)  +  d) . 

p>0 

The  hypothesis  that  d{H)  is  a  concave  function  is  often  wrong;  there  are  three  ways  to  counter 
this  difficulty: 

•  Stop  at  Step  3,  and  check  that  rj(p)  is  the  Legendre  transform  of  d{H)-,  however  this 
weak  form  of  the  multifractal  formalism  is  of  little  interest  since  d{H)  is  the  mathe¬ 
matical  object  of  interest,  and  rj{p)  is  the  only  computable  quantity  in  practice. 

•  Assert  that  we  thus  obtain  only  the  convex  hull  of  the  spectrum.  This  is  fine  when 
the  function  obtained  is  strictly  concave,  but  it  yields  ambiguous  information  when 
it  contains  straight  segments,  which  is  often  the  case.  Do  these  segments  correspond 
to  effective  points  of  the  spectrum,  or  are  they  just  the  convex  hull  in  a  non-concave 
region? 

•  Do  not  use  the  partition  function  (4.4,  but  instead  deal  directly  with  histograms  of 
wavelet  coefficients;  we  will  discuss  this  approach  in  the  next  section. 

There  exist  several  mathematical  examples  where  the  Holder  exponent  can  be  analyti¬ 
cally  determined,  and  the  validity  of  the  multifractal  formalism  has  been  successfully  tested 
(including  selfsimilar  functions  (see  [17]  and  [6]),  specific  historical  functions,  (see  [16],  and 
references  therein)  and  Levy  processes,  see  [15]).  These  examples  give  some  insight  about 
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sufficient  conditions  for  the  validity  of  the  multifractal  formalism.  In  each  case,  the  function 
(or  its  wavelet  transform)  exhibits  some  selfsimilarity  (deterministic  or  statistical). 

5.  Multifractal  formalism:  Mathematical  results 

To  simplify  some  arguments,  we  suppose  from  now  on  that  the  functions  we  consider  are 
defined  on  IR,  are  1 -periodic,  belong  to  C^(IR),  and  that  we  use  one-dimensional  periodized 
wavelets. 

5.1.  Upper  bounds  for  spectra 

We  start  by  describing  bounds  on  spectra  of  singularities  which  hold  in  full  generality. 
These  bounds  are  based  on  histograms  of  wavelet  coefficients,  so  that  we  start  by  defining 
relevant  quantities  derived  from  these  histograms.  For  each  let 

(5.1)  iV,(a)  =  #{lQfc|>2-“^}. 

If  p(q;,  e)  =  lim  sup  log2(A^j(Q:  +  e)  —  Nj(a  —  e)),  we  note  p{a)  =  inf  /o(a,  e).  (There 

j->-oo 

are  about  coefficients  of  size  of  order  2““^  .)  The  scaling  function  77/ (p)  can  be  derived 
from  the  wavelet  histograms  by 

(5.2)  77/(p)  =  liminf  ( -^log2(2“^  f  Nj{a)da) 

j^+00  \  j  J 

Indeed,  by  definition  of  Nj,  XIa:  The  following  result  of  [19]  gives 

the  relationship  between  the  “Besov  approach”  and  the  “wavelet  histograms  approach”.  It  is 
a  direct  consequence  of  (5.2)  and  shows  that  77/ (p)  can  be  deduced  from  p(q;)  by  a  Legendre 
transform.  Note  that  p(a)  cannot  be  recovered  from  77/  (p) ;  only  its  convex  hull  can.  Therefore 
p{a)  contains  more  information  on  /  than  77/ (p). 

Proposition  4.  For  any  function  f, 

(5.3)  rifip)  =  inf  {ap  -  p{a)  +  1) . 

The  following  proposition  gives  the  optimal  upper  bound  on  d{H)  that  can  be  derived 
from  the  wavelet  histograms. 

Proposition  5.  If  f  £C^{)R)  for  an  e  >  0,  then 

(5.4)  d(H)  <  H  sup 

a€[0,J?]  « 

If  /  G  C'^(IR)  for  an  e  >  0,  there  exists  a  unique  critical  exponent  Pc  such  that  ?7/(pc)  =  1; 

(5.4)  easily  implies  the  classical  bound 

(5.5)  d{H)  <  inf  (pH  -  r]f(p)  +  1), 
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of  [17].  Nonetheless  (5.4)  clearly  yields  a  sharper  estimate  if  p{a)  is  not  concave.  Thus, 
strictly  more  information  can  be  deduced  from  the  histograms  of  wavelet  coefficients  than 
from  the  scaling  function.  Note  that,  though  (5.4)  can  be  sharper  than  (5.5),  nonetheless  (5.5) 
is  optimal  in  the  sense  that  no  better  upper  bound  can  be  deduced  in  full  generality,  as  we  will 
see. 

The  optimal  bounds  (5.4)  and  (5.5)  allow  one  to  propose  alternative  formulas  for  the 
multifractal  formalism.  The  almost-sure  multifractal  formalism  holds  if  (5.4)  is  saturated, 
i.e.  if 

(5.6)  d{H)  =  H  sup 

aG[o,/r]  ca 

and  the  quasi-sure  multifractal  formalism  holds  if  (5.4)  is  saturated,  i.e.  if 

(5.7)  d(H)  =  inf  (pH  -  7]f(p)  +  1). 

P^Pc 

We  will  see  in  the  next  subsections  general  results  concerning  the  validity  of  these  formulas. 

We  conclude  this  section  with  a  general  remark  on  histograms  of  wavelet  coefficients. 
The  precise  values  taken  by  the  sequence  Nj(a)  clearly  depend  on  the  wavelet  basis  cho¬ 
sen.  One  may  wonder  what  is  the  maximal  information  that  can  be  extracted  from  wavelet 
histograms,  and  doesn’t  depend  on  the  particular  wavelet  basis  chosen;  or,  more  generally, 
what  is  the  maximal  information  that  can  be  extracted  from  wavelet  histograms,  and  is  in- 
trinseque,  meaning  here  that  it  remains  unchanged  when  applying  to  the  sequence  of  wavelet 
coefficients  an  operator  A  such  that 

(5.8)  Aef\M'<  and  /t"'  e  f)  M\ 

7>0  7>0 

One  can  check  that  p{oi)  is  not  intrinseque,  but  that 

A(q!)  =  lim  I  sup  p{a')  | 

\a'ei-oo,a+e]  J 

is  intrinseque,  and  that  it  is  ‘maximal’,  in  the  sense  that  any  intrinseque  quantity  deduced 
from  p{a)  can  be  deduced  from  A  (a). 

5.2.  Quasi-sure  results 

This  section  describes  results  from  [19].  Formula  (5.7)  certainly  does  not  hold  in  full 
generality,  and  it  is  extremely  easy  to  construct  counterexamples.  On  the  other  hand,  each 
time  it  has  been  shown  to  hold,  it  was  the  consequence  of  a  functional  equation  satisfied 
by  the  function  under  study  (usually  a  selfaffinity  property,  either  exact,  approximate,  or 
stochastic).  Therefore,  the  general  consensus  among  mathematicians  and  physicists  was  that 
the  validity  of  the  multifractal  formalism  must  be  a  consequence  of  the  precise  inner  structure 
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of  the  function  considered.  Actually,  the  opposite  is  true;  (5.7)  holds  for  quasi-all  functions, 
i.e.,  outside  a  set  of  the  first  class  of  Baire.  Let  us  explain  more  precisely  what  we  mean. 

The  multifractal  formalism,  reformulated  as  above,  states  that  if  /  belongs  to  the  topo¬ 
logical  vector  space 

(5.9)  V'=  fi 

e>0,p>0 

then  its  spectrum  of  singulgularities  satisfies  (5.7). 

The  space  V  (=  Vj^)  is  a  Baire’s  space,  i.e.,  any  countable  intersection  of  everywhere 
dense  open  sets  is  everywhere  dense;  we  will  see  that,  in  V,  the  set  of  functions  that  satisfy 
(5.7)  contains  a  countable  intersection  of  everywhere  dense  open  sets  of  V,  i.e.,  contains  a 
dense  Gs  set  (we  say  that  quasi-all  functions  of  V  satisfy  (5.7)).  In  order  to  precisely  state 
our  result,  we  first  have  to  determine  what  are  the  conditions  satisfied  by  a  function  T]{p)  so 
that  it  is  a  scaling  function.  The  following  definition  follows  directly  from  Proposition  1  and 
(4.2);  it  characterizes  scaling  functions. 

Definition  6.  A  function  r){p)  :  IR"^  ->  m  is  strongly  admissible  if  s{0)  >  0  and  if 
s{q)  =  qri{l/q)  is  concave  and  satisfies  0  <  s'(q)  ^  1. 

One  immediately  sees  that  if  T]{p)  is  strongly  admissible,  it  is  concave. 

Theorem  3.  Let  r}(p)  be  a  strongly  admissible  function  and  V  be  the  function  space  de¬ 
fined  by  (5.9).  The  domain  of  definition  of  the  spectrum  of  singularities  of  quasi-all  functions 
ofV  is  the  interval  [^(O),  l/pj  yvhere 

(5.10)  d{H)  =  inf  {Hp  -  r}{p)  -f  1). 

P^Pc 

Remarks:  Formula  (5,10)  states  that  the  spectrum  of  quasi-all  functions  is  composed  of 
two  parts: 

•  A  part  defined  by  H  <  q'(pc)  where  the  infimum  in  (5.10)  is  attained  for  p  >  pc,  and 
the  spectrum  can  be  computed  as  the  ‘usual’  Legendre  transform  of  7]{p) 

d{H)  =  mf(Hp-v(p)  +  l]. 

•  A  part  defined  by  r?'(pc)  ^  ^  l/pc  where  the  infimum  in  (5.10)  is  attained  for 

p  =  Pc,  and  the  spectrum  is  a  straight  segment  d(H)  —  Hpc. 

This  second  case  shows  that  the  initial  formulation  of  Frisch  and  Parisi  (where  the  Legendre 
transform  is  taken  on  all  positive  p’s)  fails  in  this  part  of  the  spectrum.  Comparing  (5)  and 

(5.10)  we  see  that  quasi-all  functions  of  V  strive  to  have  their  Holder  singularities  on  a  set  as 
large  as  possible. 

The  study  of  the  properties  of  quasi-all  functions  with  a  given  a  priori  regularity  goes 
back  to  the  famous  paper  of  Banach  [5],  which  gives  differentiability  properties  of  quasi- 
all  continuous  functions.  Recently  Z,  Buczolich  and  J.  Nagy  proved  in  [9]  that  quasi-all 
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monotone  continuous  functions  on  [0, 1]  are  multifractal  with  spectrum  d{H)  ~  H  for  H  e 

[0.1]. 

5.3.  Almost-sure  results  from  histograms 

This  section  describes  results  from  [4].  We  start  by  describing  the  processes  we  will 
study.  We  suppose  that,  at  each  scale  j,  the  wavelet  coefficients  of  the  process  are  picked 
independently  from  a  given  histogram.  We  denote  by  pj  the  common  probability  measure 
of  the  2^  random  variables  Xj^k  =  —  (log2  \^\\)/j  (1^^  signs  of  the  wavelet  coefficients  have 
no  consequence  for  Holder  regularity;  therefore,  we  do  not  need  to  make  any  assumption  on 
them).  The  measure  pj  thus  satisfies 

P(log(K|)<2-“l)=p,((-,^,o]). 

We  need  to  make  two  assumptions  on  the  pj.  The  first  one  is 

3e  >  0  :  Supp  {pj)  C  [e,  +oo); 

This  assumption  means  that  the  sample  paths  belong  to 

Let  us  now  define  some  quantities  that  will  be  pertinent  in  our  study.  For  each  j,  let 
Nj{a)  =  #{k  :  \Cjk\  ^  2““^}  obtained  after  these  2^  draws  have  been  performed.  There¬ 
fore,  JE{Nj{a))  =  2^pj([0,  a]).  Note  that  the  word  empirical  will  be  used  in  the  following 
in  relation  to  random  quantities  that  are  measured  on  the  sample  paths  (as  opposed  to  deter¬ 
ministic  quantities  that  are  derived  from  the  pj).  We  note 

p(a)  =  limlimsup  log.  (2^,(1^^  -  a  +  e])) 

and  p{a)  is  the  corresponding  empirical  quantity  already  defined.  The  purpose  of  the  fol¬ 
lowing  hypothesis  is  to  make  sure  that  some  sizes  of  wavelet  coefficients  do  not  appear  with 
small  but  nonvanishing  probability.  If  it  holds,  quantities  deduced  from  histograms  and  sam¬ 
ple  paths  will  coincide. 

(Either  p{a)  —  ~oo  or  p{a)  ^  0.  If  p(a)  =  0,  there  exists 
a  subsequence  and  a  sequence  e„  ->  0  such  that 
~  +  ^n])  ^  2j2* 

Proposition  6.  If  Hypothesis  {%)  holds,  with  probability  one,  Va  >  0  p{a)  —  p(a). 
From  now  on,  we  suppose  that  p(q;)  >  0  for  at  least  one  a.  Let 


S.  Jaffard  /  Wavelet  expansions,  function  spaces  and  multifractal  analysis 


143 


Theorem  4.  Let  f  be  a  random  wavelet  series  satisfying  {H),  The  spectrum  of  almost 
every  sample  path  off  has  support  included  in  [e,  Hmax]^  where 

(5.11)  d{H)  =  H  sup 

ae[o,/r]  Oi 

Remarks:  By  inspecting  (5.11),  it  is  clear  that  d{H)  need  not  be  concave,  which  shows 
another  possible  occurrence  of  the  failure  of  the  standard  multifractal  formalism. 

Comparing  Proposition  5  and  Theorem  4,  we  see  that  the  spectrum  d{H)  of  a  random 
wavelet  series  takes  the  largest  possible  values  compatible  with  the  bounds  (5.4),  which  shows 
that  these  bounds  are  optimal. 
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Some  Plots  of  Bessel  Functions  of  Two 

Variables 


F.A.  Griinbaum 

Department  of  Mathematics 
University  of  California 
Berkeley,  CA  94720 

Abstract.  For  a  finite  reflection  group  G  there  is  a  rich  theory  developed  by  Dunkl, 
Heckman  and  Opdam  leading  to  the  notion  of  a  commuting  set  of  Bessel  differen¬ 
tial  operators.  These  systems  play  an  important  role  in  the  study  of  Calogero-Moser 
systems  and  other  problems  of  physical  interest.  When  G  acts  on  the  real  line  one 
recovers  the  usual  Bessel  function  with  a  well  known  power  series  expansion  at  the 
origin.  We  obtain  some  such  expansions  in  the  case  of  G  =  A2  acting  in  the  plane 
and  we  use  these  to  produce  plots  of  some  of  these  functions. 


1.  Introduction 

Special  functions  such  as  Gauss’  hypergeometric  series  and  the  Bessel  functions  have 
played  a  crucial  role  in  math-physics  for  a  very  long  time,  and  many  branches  of  engineer¬ 
ing  and  technology  are  good  examples  of  such  use.  The  appearance  of  symbolic/numer¬ 
ical/graphics  packages  such  as  Macsyma,  Maple,  Mathematica,  and  others  has  put  plots  of 
these  functions  at  the  fingertips  of  many  users  in  applied  fields. 

The  pathbreaking  work  of  Dunkl,  Heckman  and  Opdam,  de  Jeu  and  others  has  produced 
a  rich  theory  of  Bessel  functions  of  several  variables  which  extends  in  a  natural  way  the  one 
dimensional  situation.  There  are  also  several  versions  of  Gauss’  function  in  the  case  of  several 
variables,  and  a  large  program  dealing  with  polynomials  has  been  developed  by  Macdonald, 
Koomwinder  and  Cherednik,  as  well  as  others. 


The  author  was  supported  in  part  by  NSF  Grant  #  DMS94-00097  and  by  AFOSR  under  Contract 
FDF49620-96-1-0127. 
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These  functions  have  arisen  at  times  in  connection  with  integrable  systems  or  as  spherical 
functions  for  appropriate  symmetric  spaces.  One  can  wonder  if  these  relatively  recent  mathe¬ 
matical  objects  will  eventually  have  an  impact  on  technology  comparable  to  the  one  variable 
case,  and  how  much  they  will  penetrate  the  engineering  literature. 

There  are  at  least  three  interrelated  answers  to  the  question  above: 

a)  this  will  happen  only  if  interesting  “down-to-earth”  problems  ever  get  solved  in  terms 
of  these  new  functions. 

b)  the  amount  of  heavy  going  mathematical  sophistication  needed  to  go  through  this  new 
material  makes  it  unlikely  that  enough  “applied  math”  people  will  take  the  time  to 
learn  about  these  new  tools  and  test  their  applicability. 

c)  maybe  some  few  plots  will  not  hurt.  After  all,  the  best  way  to  see  the  usefulness  of 
sines  and  cosines — as  well  as  Bessel  functions,  elliptic  functions,  and  many  other  such 
gems — is  to  view  some  few  graphs  of  them.  Keep  in  mind  that  whether  we  like  it 
or  not,  more  and  more  engineering  students  will  be  trained  in  front  of  a  screen  with 
increasing  graphical  capabilities. 

The  goal  of  this  note  is  rather  modest,  I  want  to  show  some  of  these  plots  and  hopefully 
provoke  someone  into  doing  a  better  job.  Given  the  quality  of  the  graphs  that  I  show  this 
should  not  be  too  much  of  a  challenge.  I  have  not  seen  any  such  graphs,  or  even  some  of  the 
power  series  expressions  that  I  will  use  to  compute  with,  in  the  literature.  It  is  clear  that  for 
computational  purposes  eventually  one  would  like  something  smarter  than  just  power  series, 
like  piecewise  rational  approximation  or  similar  things.  This  can  wait  until  we  see  if  these 
new  functions  get  used  enough  so  as  to  warrant  such  an  effort. 


2.  The  eigenvalue  problem 

On  the  plane  with  coordinates  (a,  b)  consider  the  operators  opi  and  op2  given  by 


opiif)  = 

+  k 

OP2{f)  = 


II  ll 

db‘^  da^ 


((\/36  -t 


+ 


1 


W) 


(x/36-l-3a)2  (v^6-3a)2  12fe2 

((v/36-3a)2  “  (x/36+3a)'^~)  i 


2\/3 


d^}  o  d^S 


+ 


da^  dadb'^ 

((V36+3n'’  + 


)2  '  (\/36-3of 


-  — ^ 
662  j 


Here  k  is  an  arbitray  parameter. 
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These  operators  are  invariant  under  the  operations  1)  6  goes  into  -6  as  well  as  2)  (a,  b) 
goes  into  (-a/2  H-  \/3/26,  VS/2a  +  b/2).  The  first  one  is  a  reflection  across  the  a  axis,  the 
second  one  a  reflection  across  the  axis  given  by  the  vector  (— \/3, 1).  The  same  is  true  if  one 
considers  one  more  reflection,  across  the  axis  making  an  angle  of  —60  degrees  with  the  a 
axis.  There  is  then  a  three-fold  symmetry,  and  since  these  reflections  generate  a  group  of  six 
elements  (the  symmetric  group  on  three  symbols)  the  full  group  of  symmetries  has  order  six. 

The  operators  commute  and  we  define  the  Bessel  function  as  the  function  made  symmetric 
by  adding  six  terms  of  the  form 

/(a,  6,  si,  S2)  =  -f  small  at  infinity) 

that  solves  the  system 

opiif)  =  Ai/ 

and 

OP2{f)  =  A2/. 

This  clearly  requires  Ai  =  sf  +  52  and  A2  =  sf  -  3siS2- 

Put  a  =  r  cos  f  and  b  =  r  sin  f  and  put  x  =  cos  3t  and  make  the  analoguous  change  of 
variables  on  the  spectral  side  cos  s,  S2  =  P  sin  s,y  —  cos  3s. 

For  later  use,  introduce  p  as  any  root  of  the  equation 

~  3  ' 

Conjugate  the  resulting  operators  by  the  factor 

rP(i  _ 


i.e.  define  oppi  by 

r^(l  —  x^y^^  oppi  =  opi  r^(l  —  x^y^^ 
to  get  new  operators  oppi  (i  =  1, 2)  given  by 

~  ,  ^(2p  -|- 1)  (ff 

7.2  ^2  j.  ^y.2 


and 


3f  (35a:^  -  4p^  +  6p  -  17) 

270(x-lf(x  +  l)^ 

^3 

1350(i-l)x(x  +  l) 

y»3 


r2 

27d^(x-l)x(x  +  l) 
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3fx  (ff 
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respectively. 

Consider  now  the  resulting  equations 

OPPI  f  =  0^f 
opp2  f  =  P^yf. 

It  is  easy  to  see  that  they  admit  solutions  of  the  form 

mthcijki^  Cklij. 

In  particular  we  get  the  well  known  fact  that  the  Bessel  functions  are  symmetric  in  the  in¬ 
terchange  between  “spatial”  and  “spectral”  variables.  This  is  then  a  “trivial  or  basic”  instance 
of  the  bispectral  property  discussed  in  [DG]  but  for  systems  of  partial  differential  operators. 
In  this  connection  it  is  pleasing  to  see  that  the  relations 

a(f{oppi)(r‘^)  =  0  i  =  1,2 


and 

ad^{oppi){r^x)  =0  2  =  1,2 

hold  in  this  case. 

If  we  are  in  the  simpler  case  when  /3  =  ?/  =  1  we  have  solutions  of  the  form 

1  -I-  ^ 


to  the  equations 


opPi  f  =  f,  opp2  f  =  f. 


For  this  case  we  have  a  simple  expression  for  the  coefficients  Cij,  namely 


Ci,-  =  r(p)/r(p/3)3T(p/3  +  2  +  j)/(22i+2i2y  !r(p  +  22  +  3j)) 

and  this  will  be  used  in  the  plots  below.  These  plots,  made  for  different  values  of  p,  give  an 
indication  of  /  for  a:  =  cos  30  in  (-1, 1)  and  r  in  (0, 1). 


To  get  the  true 
gating  factor 


Bessel  functions  we  still  need  to  multiply  these  expressions  by  the  conju- 
r^il  - 


used  above. 

E.  Opdam  [02]  mentioned  to  me  that  M.  de  Jeu  had  made  some  relevant  computations.  I 
have  received  some  notes  from  de  Jeu  [dJ2]  who  has  obtained  other  expressions  which  could 
be  used  in  producing  plots  of  the  Bessel  functions. 

There  are  also  expressions  for  those  Bessel  functions  in  [O]  that  could  be  used  for  plotting 
purposes. 
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Figure  3.  p=-3/2 


Figure  4.  p=l/2 
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R.  Tolimieri  and  M.  An 

Abstract.  The  introduction  of  the  Cooley-Tukey  Fast  Fourier  transform  (C-T  FFT) 
algorithm  in  1968  was  a  critical  step  in  advancing  the  widespread  use  of  digital  com¬ 
puters  in  scientific  and  technological  applications.  Initial  efforts  focused  on  realizing 
the  potential  of  the  immense  reduction  in  arithmetic  complexity  afforded  by  the  FFT 
for  computing  the  finite  Fourier  transform  and  convolution.  On  existing  serial  butter¬ 
fly  architectures,  this  limited  implementations  of  the  FFT  to  transform  sizes  a  power 
of  two. 

The  original  C-T  FFT  algorithms  and  its  many  extensions  rested  mainly  on  addi¬ 
tive  structures  of  the  data  indexing  set  and  divide-and-conquer  strategies  for  reducing 
complexity.  The  relative  cost  of  multiplications  as  compared  with  additions  on  these 
early  machines  motivated  the  study  of  new  algorithms  for  reducing  multiplications 
usually  at  the  cost  of  increasing  additions.  These  new  algorithms  were  based  on  mul¬ 
tiplicative  structures  of  the  data  indexing  set.  They  resulted  in  greater  flexibility  as 
compared  to  earlier  efforts  by  allowing  for  fast  small  prime  size  computations  which 
could  be  embedded  in  a  wide  collection  of  large  transform  sizes. 

During  the  1980s,  the  increasing  importance  of  RISC  and  parallel  computers  re¬ 
moved  some  of  the  initial  motivation  for  these  multiplicative  FFTs.  Many  RISC  ar¬ 
chitectures  featured  a  hardwired  multiply  and  accumulate  which  permitted  multiplica¬ 
tions  to  be  nested  in  additions.  The  goal  was  to  arrange  the  computations  so  that  most 
multiplications  were  followed  by  an  addition.  Parallel  architectures  placed  the  major 
algorithmic  burden  on  controlling  the  data  flow.  The  exotic  data  flow  of  the  multi¬ 
plicative  FFTs  were  often  incompatible  or  at  least  required  an  immense  coding  effort 
for  efficient  parallel  computation.  Only  the  need  for  flexible  transform  sizes  justified 
continued  efforts. 

The  recent  importance  of  FPGAs  and  reconfigurable  hardware  renews  the  need 
to  reduce  multiplication  counts.  In  this  talk,  we  will  review  some  of  the  work  on 
multiplicative  approaches  introduced  mainly  at  IBM  in  the  1970s  and  1980s  and  place 
these  results  within  the  realm  of  harmonic  analysis. 


I.  Introduction 

The  fast  Fourier  transform  (FFT)  algorithm  has  a  long  history  dating  from  Gauss  but  has 
over  the  years  been  rediscovered  in  a  variety  of  contexts.  It  is  closely  related  to  the  Poisson 
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summation  formula  and  the  Chinese  remainder  theorem  for  polynomials.  In  the  latter  it 
reflects  the  semi-simplicity  of  the  complex  group  algebra  of  Z/N,  N  ^  2. 

In  1965  the  FFT  was  introduced  to  IBM  by  James  Cooley  [7]  and  was  the  joint  work  of 
James  Cooley  and  John  Tukey.  Although  other  FFT  algorithms  such  as  the  Good-Thomas 
prime  factor  algorithm  had  been  in  use,  especially  in  geophysics  [10, 16],  the  great  reduc¬ 
tion  in  computation  time  for  large  size  problems,  flexibility,  ease  of  coding,  and  widespread 
applicability  of  the  Cooley-Tukey  (CT)  FFT  algorithm  was  quickly  recognized  by  IBM  man¬ 
agement.  They  saw  it  as  the  key  tool  for  the  rapidly  expanding  use  of  digital  computers. 

The  success  of  the  CT  FFT  algorithm  often  hid  several  of  its  disadvantages  in  scientific 
and  engineering  applications.  In  fact,  from  the  very  beginning,  especially  at  IBM,  other 
algorithms  for  computing  the  finite  Fourier  transform  had  been  discovered  and  sporadically 
found  use.  Typically  these  algorithms  increase  the  number  of  additions  and  decrease  the 
number  of  multiplications  as  compared  with  the  CT  FFT. 

During  the  1980s,  with  the  introduction  of  reduced  instruction  set  computer  (RISC)  pro¬ 
cessors  which  are  capable  of  nesting  multiplications  inside  additions  (pipelined  dual  ops), 
these  approaches  were  ignored. 

We  will  discuss  two  historically  neglected  approaches  which,  driven  by  recent  technolog¬ 
ical  advances,  have  become  increasingly  important,  the  Rader-Winograd  multiplicative  FFT 
algorithms  and  the  reduced-polynomial  transform  algorithms. 

Much  of  this  work  was  carried  out  during  the  late  1970s  and  early  1980s  by  S.  Wino- 
grad  [19-22]  and  H.  J.  Nussbaumer  [12, 13]  and  placed  in  a  mathematical  framework  by 
several  of  their  collaborators  including  L.  Auslander,  E.  Feig  [3]  and  at  a  later  time  by  C.  S. 
Burrus  [5,6],  I.  Gertner  [9],  and  M.  Rofheart  [15].  Additional  results  and  references  can  be 
found  in  texts  [4,8, 11, 17, 18]. 

Proofs  of  theorems  stated  in  this  presentation  are  in  [18], 

2.  Multiplicative  Theory 
2.1.  Introduction 

Standard  divide-and-conquer  FFT  algorithms  are  based  on  the  existence  of  nontrivial  sub¬ 
groups  of  the  natural  additive  group  structure  of  the  underlying  indexing  set.  The  specification 
of  a  subgroup  decomposes  the  indexing  set  into  cosets  and  the  computation  is  decomposed 
relative  to  the  coset  partitioning.  The  importance  of  two-to-a-power  size  indexings  in  early 
programming  efforts  is  due  to  the  plentitude  of  subgroups  and  the  simplicity  of  implementing 
the  two-point  Fourier  transform  (butterfly).  Data  readdressing  is  especially  regular  (striding) 
and  was  eventually  hardwired  into  many  architectures  (LOAD-STORE). 

The  immense  speed-up  in  run  time  of  two-to-a-power  FFT  codes  as  compared  with  direct 
computation  resulted  in  significant  compromises  in  many  applications.  The  need  to  zero- 
pad  from  a  natural  computation  size  to  the  nearest  two-to-a-power  size  introduced  increased 
memory  requirements  and  less  accuracy,  especially  in  multidimensional  problems.  In  some 
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applications,  such  as  X-ray  crystallography,  symmetry  relations  on  data  which  had  played  an 
essential  role  before  the  FFT  became  difficult  to  fully  exploit  inside  standard  FFT  codes. 

Moreover,  the  regularity  of  these  codes  caused  memory  call  conflicts  on  many  large  vector 
and  parallel  processors.  The  need  for  fast  Fourier  transform  codes  acting  on  more  flexible 
data  set  sizes  became,  by  the  end  of  the  1970s,  an  increasingly  important  goal.  The  most 
important  step  in  achieving  this  goal  was  quickly  seen  to  be  the  designing  of  algorithms  for 
small  prime  size  Fourier  transforms  which  could  then  be  nested  in  standard  FFT  codes. 

C.  Rader  [14]  and  independently  S.  Winograd  had  developed  such  algorithms  in  the  early 
1970s,  but  for  the  most  part  these  algorithms  were  viewed  as  mathematical  oddities.  These 
algorithms  suffered  from  several  disadvantages  which  at  the  time  precluded  their  widespread 
adoption.  Coding  is  difficult,  with  each  size  requiring  a  machine  dependent  special,  time- 
consuming  coding  effort  to  implement  complex  data  readdressing.  On  reduced  instruction  set 
computer  (RISC)  architectures  the  instruction  set  for  these  codes  can  use  significant  memory. 

From  the  late  1980s  codes  for  small  prime  size  Fourier  transforms  have  increasingly  be¬ 
come  part  of  standard  programming  packages.  In  this  section  we  will  first  describe  the  orig¬ 
inal  derivations  of  C.  Rader  and  S.  Winograd  and  then  place  them  in  a  harmonic  analysis 
framework. 

2.2.  Rader  algorithm 

The  additive  group  Zfp,  p  a.  prime,  has  no  nontrivial  subgroups,  but  the  multiplicative 
group  U(p)  of  nonzero  elements  is  cyclic.  The  main  idea  underlying  the  p-point  FFT  as 
described  by  C.  Rader  is  that  the  p-point  Fourier  transform  relative  to  input  and  output  data, 
ordered  by  powers  of  any  generator  of  U{p),  has  an  especially  simple  form:  the  p-point 
Fourier  transform  becomes  a  (p  ~  l)-point  cyclic  convolution. 

For  odd  prime  p,  the  group  of  units  U{p^)  of  Z/p^  is  again  a  cyclic  group.  Using  this 
result  S.  Winograd  independently  extended  Rader’s  result  to  odd  prime  power  sizes.  Fourier 
transform  algorithms  over  finite  fields  can  be  developed  in  a  similar  manner.  These  last  FFT 
algorithms  are  important  in  number  theoretic  transforms  and  in  error-correcting  coding  [1,2]. 

Suppose  p  is  an  odd  prime  and  t  =  p  —  1.  The  unit  group  U(p)  of  Z/p  is  a  cyclic  group 
of  order  t.  Choosing  a  generator  g  of  U  (p),  we  have 

u{p)  = 

Denote  by  itg  the  permutation  of  Z/p  defined  by 

’Tj  =  (0  1  5  •  •  •  9*"*)  . 

and  by  Pg  the  p  x  p  permutation  matrix  defined  by 

Pg  —  [Cq  Gi  Op  •  •  •  Ggt-i] 

where  e„  is  the  p-tuple  having  1  in  the  n-th  position  and  0  otherwise,  0  ^  n  <  p.  The  main 
result  of  C.  Rader  is  contained  in  the  following  theorem. 


154 


R.  Tolimieri  andM.  An  /  Lesser  Known  F FT  Algorithms 


Theorem  1. 


where  C(p)  is  the  {p~  1)  x  (p  -  1)  skew-circulant  matrix 

V  • 

v9  y9^ 


Example  1.  Forp  =  5  and  g  =  2, 

■  1  0  0  0  0  ' 

0  10  0  0 
P2-  0  0  10  0 

0  0  0  0  1 

0  0  0  1  0 


C{p)  =  = 
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Theorem  2. 

■  1  t 

0 

1  -1 

(1  ®  F(«)->)  P-'F(p)P,  (1  ®  F{t))  = 

0 

Pi 

Consider  the  functions  Xn,  0  <  n  <  on  Z/p  defined  by  the  following  table. 


Table  1.  Multiplicative  characters 


Since  Hn,0  <t,  can  be  written  as 


Hn=  Iv  ...  ^ 


we  have  the  following  result. 

Theorem  3. 

Pn  =  P(p)(Xn)(l),  0^n<t. 

2.3.  Multiplicative  characters 

Table  1  describes  the  multiplicative  characters  of  Ti/p.  Such  characters  were  introduced 
by  Gauss  and  studied  in  his  work  on  reciprocity  laws.  The  complex  constants  0  <  n  < 
are  called  the  Gauss  sums  of  multiplicative  characters. 
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The  general  definition  is  given  as  follows.  A  mapping  x  '  U{p)  Ch  called  a  multi¬ 
plicative  character  mod  p  if  x  is  a  homomorphism  from  the  multiplicative  group  U (p)  into 
the  multiplicative  group  of  nonzero  complex  numbers, 

X{ab)  =  x{a)x{b),  a,beU{p). 

A  multiplicative  character  mod  p  is  completely  determined  by  its  value  on  the  generator  g  of 
U{p)  by 

=  xigr,  0  <  m  <  t 

Since  p*  =  1,  x(p)  is  a  t-ih  root  of  unity.  For  0  ^  n  <  t,  denote  by  Xn  the  multiplicative 
character  defined  by 

(2.1)  Xn{g)=w^ 

and  observe  that  the  multiplicative  characters  Xn,  0  ^  n  <  t,  defined  by  (  2.1)  coincide  with 
the  functions  in  Table  1. 

Denote  by  L'^{Z/p),  the  inner  product  space  of  all  complex  valued  functions  on  Z/p  with 
inner  product 

p-i 

n=0 

where  *  denotes  complex  conjugation. 

Setr 

Theorem  4. 

Xr(ff)  =  -1 

and 

Xn  =  Xt-n,  Q^n<t. 

Theorem  5.  For  any  two  multiplicative  characters  x  ond  x'-  have 

Extend  the  domain  of  definition  of  each  multiplicative  characters  x  to  Z/p  by  setting 
X(0)  =  0.  Setpn  =  t~^Xn,  0Kn<t. 

Theorem  6.  The  set 

Mo?  •  •  •  5 

is  an  orthonormal  basis  of  {Z/p). 

Since  F{n)‘^f{x)  =  nf{~x)  we  have 

F(p/x{x)  =PX{-x)  -px{-i)x{x), 
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implying  that  the  multiplicative  characters  are  eigenvectors  of  F(p)^, 

F(p/x  =  px(-l)x 

and  that  the  space  spanned  by 

{x,  F{p)x} 

is  invariant  under  the  action  of  F{p),  The  action  of  F{p)  on  the  multiplicative  characters  is 
described  in  the  next  two  theorems. 

Theorem  7. 

F(p)eo  =  fio  +  Xo- 
F{p)xo  =  teo-Xo- 
For  any  nontrivial  multiplicative  character  x,  we  have 
Theorem  8. 

F(p)x  =  V^Gp(x)x*. 

where 

F{p)x{l)  =  V^pix)- 

Proof 

F{p)xix)  =  XI 

vec/(p) 

yec/(p) 

=  X(a:"^)  X 

y^u(p) 

=  X(a:’^)^’(p)x(l)- 

By  ||F(p)/|P  =p|l/||^wehave 

|Gp(x)l  =  1- 

Relative  to  the  orthonormal  basis  eo,  of  we  have 

F(p)eo  ==  eo  +  i2^o, 

F(p)pq  =  f^eo-A^o 
and 

F{p)pn  =  ^JpGp{x)ut-n,  l^n<t. 
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The  matrix  of  F{p)  relative  to  this  basis  is  given  by 


■  1 

0 


v/pGp(x(-i)  • 


0 


L  VPGp(x,)  J 

This  matrix  representation  of  F{p)  is  essentially  the  same  as  that  in  theorem  2.  It  is  the  first 
step  in  several  studies  into  the  p-point  Fourier  transform  [3].  Other  studies  include: 

•  construction  of  an  orthonormal  basis  diagonalizing  F{p), 

•  determining  rational  subspaces  of  F(p). 

3.  Multidimensional  Algorithm 

3.1.  Introduction 

The  reduced  transform  algorithms  (RTA)  [9, 15, 17]  compute  a  multidimensional  Fourier 
transform  by  first  projecting  the  multidimensional  input  data  onto  lines  (more  generally,  lower 
dimensional  planes)  and  then  computing  the  one-dimensional  Fourier  transform  of  the  pro¬ 
jected  data.  By  the  periodic  version  of  the  projection  slice  theorem  from  tomography,  these 
one-dimensional  Fourier  transforms  compute  the  multidimensional  Fourier  transform.  The 
RTA  can  be  viewed  as  periodic  Radon  transforms. 

Polynomial  rings  play  many  roles  in  algorithm  design.  Linear  convolution  can  be  defined 
as  polynomial  product  and  cyclic  convolution  mod  N  can  be  defined  as  polynomial  prod¬ 
uct  mod  —  1.  Fourier  transform  can  be  defined  over  certain  quotient  polynomial  rings. 
The  Chinese  remainder  theorem  (CRT)  for  polynomial  rings  is  especially  important  in  the 
Cook-Toom  and  Winograd  convolution  algorithms  [18,22].  The  convolution  theorem  which 
diagonalizes  cyclic  convolution  mod  N  relative  to  the  iV-point  Fourier  transform  basis  can 
be  viewed  as  the  CRT  applied  to  the  quotient  polynomial  ring  C[x]/x^  -  1, 

C[x]/x^  —  \=C[x]/x  —  1  ©  C[x]lx  ~  i;  ©  •  •  •  ©  C[x\lx  — 

where  v  = 

Polynomial  ring  theory  is  central  to  many  of  the  results  in  the  works  of  H.  J.  Nuss- 
baumer  [12, 13],  both  for  multidimensional  convolution  and  Fourier  transform  (polynomial 
transform).  One  goal  in  these  works  is  to  rely  as  much  as  possible  on  shifts  and  cyclic  shifts 
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to  carry  out  these  computations.  Polynomial  transforms  compute  multidimensional  convolu¬ 
tions  by  first  using  the  CRT  to  decompose  the  computation  into  subcomputations,  some  of 
which  can  be  computed  by  Fourier  transform  computations  over  rings  and  then  completed  by 
using  the  inverse  CRT  isomorphism  to  combine  the  subcomputations. 

The  RTA  can  be  adapted  to  computing  multidimensional  convolution  and  provides  a  geo¬ 
metric  setting  for  the  Nussbaumer  polynomial  transform.  A  multidimensional  convolution 
can  be  computed  by  RTA  which  first  projects  the  multidimensional  data  onto  lines,  computes 
the  convolution  of  the  one-dimensional  data  and  then  combines  the  subcomputations.  It  dif¬ 
fers  from  the  RTA  for  multidimensional  Fourier  transform  in  that  the  last  step  does  not  rely 
on  the  projection  slice  theorem.  We  will  address  this  last  problem  by  defining  an  inverse  pro¬ 
jection  algorithm  which  reconstructs  multidimensional  data  from  its  one-dimensional  order 
projections. 

In  some  ways  this  is  a  curious  result  since,  in  the  continuous  case,  the  inversion  requires 
either  Fourier  transform  computations  or  convolutions  and  back  projections  and  can  be  ex¬ 
tremely  costly.  However  a  close  inspection  of  these  stages  in  the  Nussbaumer  polynomial 
transform  algorithm  shows  that  this  step  can  be  accomplished  using  only  additions.  We  will 
show,  using  the  constructions  underlying  the  RTA,  how  this  last  step  can  be  implemented 
using  integer  matrix  multiplication. 


3.2.  Periodic  back  projection 


Consider  Z/3  x  Z/3.  By  a  line  in  Z/3  x  Z/3,  we  mean  a  maximal  cyclic  subgroup  of 
Z/3  X  Z/3.  There  are  4  distinct  lines  in  Z/3  x  Z/3, 

=  {(0,0),  (2,j)},  0<j<3. 

L(0,1)  =  {(0,0),  (0,1),  (0,2)}. 

Suppose  /  £  L{7ilZ  X  Z/3).  The  periodizations  of  /  orthogonal  to  the  4  lines  can  be 
constructed  as  follows.  Denote  by  f  the  lexicographically  ordered  two  dimensional  array  / 


r/(o,o)  ] 
/(l.O) 
/(2,0) 
/(0,1) 
/(l.l) 
/(2,1) 
/(0,2) 
/(1, 2) 
L/(2,2)  J 


The  periodization  orthogonal  to  L(0, 1)  consists  of  the  sums 


/(0,0)  +  /(1,0)  +  /(2,0), 
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/(0,1)  +  /(1,1)  +  /(2,1), 

/(0, 2)  + /(1, 2)  4-/(2, 2), 

which  we  can  write  as 

The  periodization  orthogonal  to  L(l,  0)  consists  of  the  sums 

/(0,0)  +  /(0,l)  +  /(0,2), 

/(1,0)  +  /(1,1)  +  /(1,2), 

/(2,0)  +  /(2,l)  +  /(2,2), 

which  we  can  write  as 

(/3  0l^)P(9,3)f-(U0/3)f. 

Set 

'001' 

^3=  10  0 
0  10 

and  =  Is®  S3®  S^.  The  periodization  orthogonal  to  L(l,  1)  is  given  by 

(/3  ®  3)Z3f  =  (U  ® /3)^3f 

and  the  periodization  orthogonal  to  L(l,  2)  is  given  by 

(h  ®  3)Z|f  =  (1^  ®  h)Zlf. 

We  assume  that  we  know 

Is  0  I3 

ll®h 

(I3  0  Is)Zs 
(1*0Z| 

and  we  want  to  compute  f .  We  will  show 

A^Af  =  (9/3  +  Es<^  Es)i, 

where  Es  is  the  3  x  3  matrix  of  all  ones. 

Since  (P3  0  P3)f  is  known  from  the  sum 

j=o  k=o 

We  can  compute  f  from  A'^Af. 

The  following  table  lists  the  matrices  in  A,  their  transposes  and  the  products  of  the  matri¬ 
ces  with  their  transposes.  >1^44  is  the  sum  of  the  matrices  in  the  last  column. 


f Af 
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Since  I  +  =  E3,  A  is  as  claimed. 
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Abstract.  The  intensities  of  a  sufficient  number  of  X-ray  diffraction  maxima  de¬ 
termine  the  structure  of  a  crystal,  that  is,  the  positions  of  the  atoms  in  the  unit  cell  of 
the  crystal.  The  available  intensities  usually  exceed  the  number  of  parameters  needed 
to  describe  the  structure.  From  these  intensities  a  set  of  numbers  |jEh|  can  be  derived, 
one  corresponding  to  each  intensity.  However,  the  elucidation  of  the  crystal  struc¬ 
ture  also  requires  a  knowledge  of  the  complex  numbers  Eu  =  \Eu\^^p{i(PH),  the 
normalized  structure  factors,  of  which  only  the  magnitudes  IEhI  can  be  determined 
from  experiment.  Thus,  a  “phase”  <^h.  unobtainable  from  the  diffraction  experiment, 
must  be  assigned  to  each  |£^h  |,  and  the  problem  of  determining  the  phases  when  only 
the  magnitudes  j£?H|  are  known  is  called  the  “phase  problem”.  Owing  to  the  known 
atomicity  of  crystal  structures  and  the  redundancy  of  observed  magnitudes  |£'h|>  the 
phase  problem  is  solvable  in  principle. 

Probabilistic  methods  have  traditionally  played  a  key  role  in  the  solution  of  this 
problem.  They  have  led,  in  particular,  to  the  so-called  tangent  formula  which,  in  turn, 
has  played  the  central  role  in  the  development  of  methods  for  the  solution  of  the  phase 
problem. 

Finally,  the  phase  problem  may  be  formulated  as  one  in  constrained  global  op¬ 
timization.  A  method  for  avoiding  the  countless  local  minima  in  order  to  arrive  at 
the  constrained  global  minimum  leads  to  the  Shake-and-Bake  algorithm,  a  completely 
automatic  solution  of  the  phase  problem  for  structures  containing  as  many  as  1000 
atoms  when  data  are  available  to  atomic  resolution. 

In  the  case  that  single  wavelength  anomalous  scattering  (SAS)  data  are  available, 
the  probabilistic  machinery  leads  to  estimates  of  special  linear  combinations  of  the 
phases,  the  so-called  structure  invariants.  A  method  of  going  from  estimates  of  the 
structure  invariants  to  the  values  of  the  individual  phases  is  described. 
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1.  Introduction 

When  a  crystal  is  irradiated  with  a  beam  of  X-rays  the  resulting  interference  effect  gives 
rise  to  the  so-called  diffraction  pattern  which  is  uniquely  determined  by  the  crystal  structure. 
Only  the  intensities  of  the  scattered  rays  can  be  measured;  the  phases,  which  are  also  needed 
in  order  to  work  backwards,  from  diffraction  pattern  to  the  atomic  positions,  are  lost  in  the 
diffraction  experiment.  However,  owing  to  the  known  atomicity  of  real  structures  and  the 
large  number  of  observable  intensities,  the  lost  phase  information  is  in  fact  contained  in  the 
measured  intensities.  The  problem  of  recovering  the  missing  phases,  when  only  the  intensi¬ 
ties  are  available,  is  known  as  the  phase  problem.  Alternatively,  since  the  magnitudes  of 
the  normalized  structure  factors  E  =  \E\  exp(i(p)  are  readily  determined  from  the  measured 
diffraction  intensities,  the  phase  problem  may  be  defined  as  the  problem  of  determining  the 
phases  ip  when  the  magnitudes  |E|  are  given. 

Due  to  the  redundancy  of  known  magnitudes  \El  the  phase  problem  is  an  over  deter¬ 
mined  one  and  is  therefore  solvable  in  principle.  This  over  determination  implies  the  exis¬ 
tence  of  relationships  among  the  Es  and,  therefore,  since  the  magnitudes  |E|  are  presumed 
to  be  known,  the  existence  of  identities  among  the  phases  (p  alone,  dependent  on  the  known 
magnitudes  \E\,  which  must  of  necessity  be  satisfied.  The  so-called  direct  methods  are  those 
which  exploit  these  relationships  in  order  to  go  directly  from  known  magnitudes  \E\  io  de¬ 
sired  phases  ip.  They  do  not  depend  on  the  presence  of  heavy  atoms  or  atoms  having  other 
special  scattering  properties,  for  example  anomalous  scatterers,  or  prior  structural  knowledge. 

The  techniques  of  modem  probability  theory  lead  to  the  joint  probability  distributions  of 
arbitrary  collections  of  normalized  structure  factors  from  which  the  conditional  probability 
distributions  of  selected  sets  of  phases,  given  the  values  of  suitably  chosen  magnitudes  \E\, 
may  be  inferred.  These  distributions,  dependent  on  known  magnitudes  \E\,  constitute  the 
foundation  on  which  direct  methods  are  based.  They  have  provided  the  unifying  thread  from 
the  beginning,  circa  1950,  until  the  present  time.  In  particular,  they  have  led  to  the  recent 
formulation  of  the  phase  problem  as  one  of  constrained  global  optimization  [1]. 

In  the  case  that  the  stmcture  consists  of  N  identical  atoms  in  the  unit  cell  the  relationship 
between  diffraction  pattern  and  crystal  stmcture  is  given  by  the  pair  of  equations 

N 

(1.1)  Eh  =  |£h|  exp(ivJH)  =  y^exp(27riH  ■  r,) 

j  =  l 

{A/'-V2  if  7^  —  r. 

0  ifr^r] 


(1.2) 


H.A.  Hauptman  /  The  Phase  Problem  of  X-ray  Crystallography 


165 


where  rj  is  the  position  vector  of  the  atom  labeled  j  and  |£^h|  is  obtained  from  the  intensity 
of  the  scattered  beam  in  the  direction  labeled  by  the  reciprocal  lattice  vector  En-  Clearly  if 
one  is  to  determine  the  crystal  structure  from  Eq.(1.2)  it  is  necessary  to  know  not  only  the 
magnitudes  IE'hI,  obtainable  from  the  diffraction  experiment,  but  also  the  phases  ipH,  lost  in 
the  diffraction  experiment. 

2.  Method 

2.1.  The  Structure  Invariants 

Equation  (1.2)  implies  that  the  normalized  structure  factors  Eu  determine  the  crystal 
structure.  However,  Eq.(l.l)  does  not  imply  that,  conversely,  the  crystal  structure  determines 
the  values  of  the  normalized  structure  factors  En  since  the  position  vectors  rj  depend  not 
only  on  the  structure  but  on  the  choice  of  origin  as  well.  It  turns  out,  nevertheless,  that  the 
magnitudes  |Eh|  of  the  normalized  structure  factors  are  in  fact  uniquely  determined  by  the 
crystal  structure  and  are  independent  of  the  choice  of  origin,  but  that  the  values  of  the  phases 
ipn  depend  also  on  the  choice  of  origin.  Although  the  values  of  the  individual  phases  depend 
on  the  structure  and  the  choice  of  origin,  there  exist  certain  linear  combinations  of  the  phases, 
the  so-called  structure  invariants,  whose  values  are  determined  by  the  structure  alone  and  are 
independent  of  the  choice  of  origin.  The  most  important  class  of  structure  invariants,  and  the 
only  one  to  be  considered  here,  consists  of  the  three-phase  structure  invariants  (triplets), 

(2.1)  (pHK  =  TH-\r<pK  +  <^-H-K5 

where  H  and  K  are  arbitrary  reciprocal  lattice  vectors. 

2.2.  The  Probabilistic  Background 

It  is  assumed  that  the  atomic  position  vectors  Vj  are  the  primitive  random  variables,  uni¬ 
formly  and  independently  distributed  in  the  unit  cell.  Then  the  normalized  structure  factors 
Eh,  as  functions  of  the  r^’s,  are  themselves  random  variables.  The  structure  invariants  (puK 
in  turn,  as  functions  of  the  individual  phases  ip  (Eq.(2.1)),  are  therefore  also  random  variables. 

2.3.  The  Conditional  Probability  Distribution  of(puK>  Given  |Eh  |, \Ek  |,  1Eh+k  | 

Under  the  conditions  set  forth  in  §  2.2  the  conditional  probability  distribution  of  the  triplet 
(pnK  (Eq-(2.1)),  given  the  presumed  known  values  of  |Eh|,|Ek|,|Eh+k|,  is  known  to  be 

(2.2)  P($|Ahk)  =  [27r/o(ylHK)]“^  exp(.4HK  cos  $) 
where  $  represents  the  triplets  (^hk,  the  parameter  Auk  is  defined  by 

(2.3)  Auk  =  2N-^/^\EtiEjiEn+K\, 

and  Id  is  the  Modified  Bessel  Function.  Equation  (2.3)  implies  that  the  mode  of  ^hk  is  zero, 
and  the  conditional  expected  value  (or  average)  of  cos  (^hk,  given  Auk*  is 

(2.4)  £(cos(^hk|^hk)  =  /i(-4hk)/^o(^hk)  >  0, 
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where  Ii  is  the  Modified  Bessel  Function.  It  is  also  readily  confirmed  that  the  larger  the  value 
of  Auk  the  smaller  is  the  conditional  variance  of  cos(^hk,  given  Auk-  It  is  to  be  stressed 
that  the  conditional  expected  value  of  the  cosine,  Eq.(2.4),  is  always  positive  since  Auk  >  0. 


2.4.  The  Minimal  Principle 


In  view  of  Eq.(2.4)  one  obtains  the  following  estimate  of  cos  puK' 
(2.5)  cos<^hk  ~  h(AuK) / h{AuK) 


and  expects  that  the  smaller  the  variance,  that  is  the  larger  ^hk,  the  more  reliable  this  es¬ 
timate  will  be.  Hence  one  is  led  to  construct  the  function  (the  so-called  minimal  function), 
determined  by  the  known  magnitudes  |E|, 


(2.6) 


R  =  R{p) 


1 

Eh,K  ^HK 


^v4hk(cos(/?hk  - 

H,K 


h{AuK)^ 


which,  in  view  of  Eq.(2.1),  is  seen  to  be  a  function  of  phases  (p  alone.  Equation  (2.4)  then 
implies  that  the  global  minimum  of  the  minimal  function  R(p),  where  the  phases  are  con¬ 
strained  to  satisfy  the  identities  known  to  exist  (§  1),  yields  the  desired  phases  (the  minimal 
principle).  Thus  the  phase  problem  is  formulated  as  a  problem  in  constrained  global  mini¬ 
mization  [2].  There  remains  only  the  problem  of  avoiding  the  myriad  local  minima  of  R{ip)  in 
order  to  arrive  at  the  constrained  global  minimum.  The  next  section  shows  how  this  minimum 
is  reached  via  the  computer  program  Shake-and-Bake, 


2.5.  The  Computer  Program  Shake -and-Bake" 

[3] 

The  six-part  Shake-and-Bake  phase  determination  procedure,  shown  by  the  flow  diagram 
in  Figure  1,  combines  minimal-function  phase  refinement  and  real-space  filtering.  It  is  an 
iterative  process  that  is  repeated  until  a  solution  is  achieved  or  a  designated  number  of  cy¬ 
cles  have  been  performed.  With  reference  to  Figure  1,  the  major  steps  of  the  algorithm  are 
described  next. 


2.5.1.  Generate  invariants.  Normalized  structure-factor  magnitudes  (jE'I’s)  are  generated  by 
standard  scaling  methods  and  the  triplet  invariants  that  involve  the  largest  corresponding  \E\ ’s 
are  generated.  Parameter  choices  that  must  be  made  at  this  stage  include  the  numbers  of 
phases  and  triplets  to  be  used.  The  total  number  of  invariants  is  ordinarily  chosen  to  be  at 
least  100  times  the  number  of  atoms  whose  positions  are  to  be  determined. 

2.5.2.  Generate  trial  structure.  A  trial  structure  or  model  is  generated  that  is  comprised  of 
a  number  of  randomly  positioned  atoms  equal  to  the  number  of  atoms  in  the  unit  cell.  The 
starting  coordinate  sets  are  subject  to  the  restrictions  that  no  two  atoms  are  closer  than  a 
specified  distance  (normally  1.2 A)  and  that  no  atom  is  within  bonding  distance  of  more  than 
four  atoms. 
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Figure  1 .  Flow  chart  for  Shake-and-Bake,  the  minimal-function  phase  refinement 
and  real-space  filtering  procedure. 


2.5.3.  Structure-factor  calculation.  A  normalized  structure-factor  calculation  (see  Eq.(l.l)) 
based  on  the  trial  coordinates  is  used  to  compute  initial  values  for  all  the  desired  phases 
simultaneously.  In  subsequent  cycles,  peaks  selected  from  the  most  recent  Fourier  series  are 
used  as  atoms  to  generate  new  phase  values. 

2.5.4.  Phase  refinement.  The  values  of  the  phases  are  perturbed  by  a  parameter-shift  method 
in  which  R{<p)^  which  measures  the  mean-square  difference  between  estimated  and  calculated 
structure  invariants,  is  reduced  in  value.  R{(f)  is  initially  computed  on  the  basis  of  the  set  of 
phase  values  obtained  from  the  structure-factor  calculation  in  step  C.  The  phase  set  is  ordered 
in  decreasing  magnitude  of  the  associated  \E\'s.  The  value  of  the  first  phase  is  incremented  by 
a  preset  amount  and  R{p)  is  recalculated.  If  the  new  calculated  value  of  R(p)  is  lower  than 
the  previous  one,  the  value  of  the  first  phase  is  incremented  again  by  the  preset  amount.  This 
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is  continued  until  R{^)  no  longer  decreases  or  until  a  predetermined  number  of  increments 
has  been  applied  to  the  first  phase.  A  completely  analogous  course  is  taken  if,  on  the  initial 
incrementation,  R{(p)  increases,  except  that  the  value  of  the  first  phase  is  decremented  until 
R{(p)  no  longer  decreases  or  until  the  predetermined  number  of  decrements  has  been  applied. 
The  remaining  phase  values  are  varied  in  sequence  as  just  described.  Note  that,  when  the 
ith  phase  value  is  varied,  the  new  values  determined  for  the  previous  i  -  1  phases  are  used 
immediately  in  the  calculation  of  R{(p).  The  step  size  and  number  of  steps  are  variables 
whose  values  must  be  chosen. 

2.5.5.  Fourier  summation.  Fourier  summation  is  used  to  transform  phase  information  into 
an  electron-density  map  (Refer  to  Eq.(1.2)). 

2.5.6.  Real-space  filtering  (Identities  among  phases  imposed).  Image  enhancement  is  ac¬ 
complished  by  a  discrete  electron-density  modification  consisting  of  the  selection  of  a  spec¬ 
ified  number  of  the  largest  peaks  on  the  Fourier  map  for  use  in  the  next  structure-factor  cal¬ 
culation.  The  simple  choice,  in  each  cycle,  of  a  number  of  the  largest  peaks  corresponding  to 
the  number  of  expected  atoms  has  given  satisfactory  results.  No  minimum-interpe^-distance 
criterion  is  applied  at  this  stage. 

Steps  C,  D,  E,  and  F  are  repeated  until  a  pre-assigned  number  of  cycles  has  been  com¬ 
pleted  or  until  the  process  converges.  The  smallest  of  the  final  values  of  the  minimal  function 
(one  for  each  trial)  reveals  the  constrained  global  minimum  of  the  minimal  function  R{(p) 
and  the  true  values  of  the  phases. 

3.  Single  Wavelength  Anomalous  Scattering  (SAS)  Data  are  Available 
3.1.  Introduction 

In  this  case  the  normalized  structure  factors  En  (compare  Eq.(l.l))  are  defined  by 

1  ^ 

(3.1)  En  =  —Y]  fj  exp(27rzH  •  rj) 


(3.2)  = 

j  =  l 

where  N  is  the  number  of  atoms  in  the  unit  cell,  Vj  is  the  position  vector  of  the  atom  labeled 
j  and  fj  is  the  (complex- valued)  atomic  scattering  factor,  presumed  to  be  known. 

3.2.  The  Probabilistic  Background 

With  the  assumption  that  SAS  diffraction  data  are  available,  the  conditional  probability 
distribution  P($)  of  the  triplet 

(3.3) 


THK  = 
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given  the  six  magnitudes 

(3.4)  |£^h|j  I^-hU^kU-E'-kI,  |.£^h+kU£^-h-k| 
is  known  to  be  [4] 

(3.5)  P($)  =  [27rIo(AuK)]~^  exp{AHK  cos($  -  a;HK)} 

in  which  Iq  is  the  Modified  Bessel  Function  and  ^hk  and  whk  are  expressed  in  terms  of  the 
six  magnitudes  (3.4)  and  the  (presumed  known)  complex- valued  atomic  scattering  factors  /. 
Compare  Eq.(3.5)  with  Eq.(2.2)  and  note  that  the  A^k  of  Eq.(3.5)  is  no  longer  defined  by 
Eq.(2.3)  but  is  instead  a  much  more  complicated  function  of  the  six  magnitudes  (3.4)  and  the 
atomic  scattering  factors  /.  Hence,  i4HK(>  0)  and  cjhk  are  here  assumed  to  be  known  for 
every  pair  (H,  K).  Note  that,  owing  to  the  anomalous  scattering,  the  six  magnitudes  (3.4) 
are,  in  general,  distinct  in  contrast  to  the  normal  case  when  |P-h|  =  |Ph|>  etc. 

In  view  of  (3.5),  the  most  probable  value  of  (pnK  is  cjrk,  and  the  larger  the  value  of  Auk 
the  better  is  this  estimate  of  (puK' 

(3.6)  cpuK  =  PuAPK-f  P-H-K  ~ 

3.3.  The  System  of  SAS  Tangent  Equations 

Fix  the  reciprocal  lattice  vector  H.  From  Eq.(3.6) 

(3.7)  pu  ~  -  Pk-  <P-h-k 


(3.8)  sin<^H  ^  sm(a;HK  ”  <^k  —  “^-h-k) 

which  has  approximate  validity  for  each  fixed  value  of  K.  Averaging  the  right-hand  side  of 

(3.8)  over  K,  naturally  using  weights  Ahk»  one  obtains 

(3.9)  sin  (pu  ^  ^  Auk  sm(a;HK  -  Pk~  P-u-k) 


Similarly 

(3.10) 


cos  pu 


YIk  ^HK 


J^K  ^HK 


K 


^HK  COs(whK  —  PK  —  P-H-k] 

K 


Eqs.(3.9)  and  (3.10)  imply 


(3.11) 


tan  pu 


Y^k  ^hk  sin(a;HK  -  V^k  -  ^’-h-k' 


Yk  ^hk  cos(cjhk  —  pK  —  P-h-k) 

the  system  of  SAS  tangent  equations.  For  each  fixed  H,  Eq.(3.1 1)  yields  two  values  for  pu 
differing  by  tt.  Eqs.(3.9)  and  (3.10)  serve  to  fix  the  quadrant. 
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3.4.  The  Maximal  Function  M{}p) 

One  defines  the  maximal  function  M ((/?),  a  function  of  the  phases  ip,  by  means  of 

(3.12)  M{ip)  =  :^ - - - 5^^HKCOs((^HK  -  ^hk) 

2_/h,k  jj 

and  infers  that  M{(p)  has  a  global  maximum  when  all  the  phases  appearing  in  Eq.(3.12)  are 
set  equal  to  their  true  values. 

3.5.  The  Maximal  Property  of  the  System  of  S AS  Tangent  Equations 

Fundamental  maximal  property.  Fix  H.  Assume  that  the  values  of  all  phases  other 
than  (pH  are  specified  arbitrarily.  Then  the  maximal  function  M{ip)  becomes  a  function, 
M{iph\t)^  of  the  single  phase  (^h-  As  a  function  of  ipn,  M{(Ph\p)  has  a  unique  maximum 
in  the  whole  interval  (0,  27r)  and  the  value  of  pH  that  maximizes  M{ph\p>)  is  given  by  the 
SAS  tangent  equation  (3.11). 

3.6.  Solving  the  System  of  SAS  Tangent  Equations 

Specify  arbitrarily  initial  values  for  all  the  phases  p.  Fix  H.  Calculate  a  new  value 
for  the  phase  pH  by  means  of  the  SAS  tangent  equations  (3.9)  to  (3.11),  in  this  way,  in 
view  of  §  3.5,  increasing  the  initial  value  of  the  maximal  function  M{p).  Fix  H'  H. 
Calculate  a  new  value  for  pH>,  again  using  (3.9)  to  (3.1 1),  the  new  value  for  pn.  and  initial 
values  for  the  remaining  phases,  thus  increasing  still  further  the  value  of  M[p).  Continue 
in  this  way  to  obtain  new  values  for  all  the  phases,  thus  completing  the  first  iteration  and, 
in  the  process,  continuously  increasing  the  value  of  M{p).  Complete  as  many  iterations  as 
necessary  in  order  to  secure  convergence.  Convergence  is  assured  since  the  iterative  process 
yields  a  monotonically  increasing  sequence  of  numbers,  the  values  of  M{p),  bounded  above 
by  unity.  Evidently  also,  the  process  leads  to  a  local  maximum  of  M{p)  and  a  corresponding 
set  of  values  for  all  the  phases  p  which  depends  on  the  values  of  the  phases  chosen  initially. 
By  choosing  different  starting  values  for  the  phases  one  obtains  different  solutions  for  the 
system  of  SAS  tangent  equations  and  different  local  maxima  of  M{p).  That  solution  yielding 
the  global  maximum  of  M{p)  is  the  one  we  seek. 

3.7.  The  Linear  Congruence  Connection 

The  problem  of  going  from  the  estimated  values  cjhk  of  the  three-phase  structure  in¬ 
variants  phk  to  the  values  of  the  individual  phases  p  may  be  formulated  as  the  problem  of 
solving  the  redundant  system  of  linear  congruences 

(3.13)  +  <P-Yi-K  =  (modulo  27r) 

each  with  weight  Ahk-  Our  solution  of  the  system  of  SAS  tangent  equations  also  yields  the 
solution  of  the  redundant  system  of  linear  congruences  (3.13). 
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Abstract.  Harmonic  analysis  has  been  the  longest  lasting  and  most  powerful  tool 
for  dealing  with  signals  and  systems  which  involve  both  periodic  and  transient  phe¬ 
nomena.  Further  impact  on  signal  processing  was  facilitated  by  what  we  refer  to 
as  representations  in  combined  spaces.  These  representations,  motivated  by  Gabor’s 
concept  of  time-frequency  information,  cells  have  evolved  in  recent  years  into  a  rich 
repertoire  of  Gabor-type  windowed  Fourier  transforms,  relating  Fourier  analysis  to 
the  Heisenberg  group.  Such  localized  bases  or  frames  are  useful  in  the  representation, 
processing,  compression  and  transmission  of  speech,  images  and  other  natural  signals 
that,  by  their  very  nature,  are  nonstationary.  To  incorporate  scale  that  lends  itself  to 
multiresolution  analysis  as  is  the  case  with  wavelets,  the  Gabor  scheme  is  general¬ 
ized  to  multiwindow  Gabor  frames.  The  properties  of  such  sequences  of  functions 
are  characterized  by  an  approach  that  combines  the  concept  of  frames  and  the  Zak 
Transform.  Results  on  signal  representation  and  reconstruction  from  partial  informa¬ 
tion  in  the  frequency  domain  are  related  to  and  derived,  using  relevant  results  obtained 
^  in  the  time  or  positional  information  domain.  Some  results  concerning  the  represen¬ 
tation  of  Fourier-transformed  discrete  time  (finite)  sequences  by  partial  information 
are  rederived  by  exploiting  the  duality  of  the  Fourier-Stieltjes  transform  and  its  in¬ 
verse.  Results  related  to  discrete  signals  are  extended  to  continuous  one-dimensional 
signals.  Signal  and  image  representation  by  phase  only  information  is  considered  also 
in  the  context  of  localized  (Gabor)  phase,  where  restoration  of  magnitude  by  iterative 
techniques  is  much  more  efficient  than  in  the  case  of  global  (Fourier)  phase. 
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1.  Introduction 

The  subject  of  harmonic  analysis  is  extremely  broad,  as  can  be  concluded,  for  example, 
from  the  viewpoint  of  Katzenelson  [16],  who  considers  it  as  the  study  of  objects  (functions, 
measures,  etc.)  defined  on  topological  groups.  Indeed,  groups  are  relevant  to  Gabor-type 
representations  and  wavelet-type  transforms  [25].  Consider  the  quasi-regular  representation 

(1.1)  T{g)h{x)  =  kh{g~''x), 

where  k  is  a.  constant  that  may  depend  on  g,  and  the  independent  variable  x  is  an  n-dimen- 
sional  vector. 

This  representation  that  satisfies  the  functional  homomorphism 

^(^1^2)  =  T(gi)T{g2) 

underlines  the  decomposition  of  a  function  h  into  a  package  of  waves,  i.e.,  wavelets 

{T(go)h,  •  •  •  ,  T{gN)h,  *  •  •  }. 

We  say  that  r(^)  is  a  unitary  representation  with  respect  to  a  measure  if 

(1.2)  (T(g)f,T{g)h),  =  {f,h),. 

A  wavelet-type  transform  is  defined,  according  to  the  group  theoretic  approach,  as  a  cross 
correlation  between  a  signal  f{x)  and  the  wavelets 

{T(9)h{x)} 

that  is  defined  as  follows: 

C/(5)  =  c  /  f(x)T{g)h*ix)n{x) 

Jm 

(1.3)  =  c{f,T{g)h)^, 

where  h*  stands  for  conjugate,  c  >  0  is  a  normalization  constant,  his  a.  template  function  (or 
a  mother  wavelet),  and  ii{x)  =  p{x)dx  is  an  appropriate  invariance  measure  (if  one  exists) 
under  the  action  of  the  group  G. 

Given  the  equation  for  the  analysis,  (1.3),  we  are  looking  for  a  way  to  recover  f(x)  from 
Cf{g),  i.e.  the  equation  of  the  synthesis  and  the  conditions  under  which  it  exists. 

Most  studies  of  wavelets  have  been  devoted  to  the  special  case  of  the  affine  group  G  — 
G{A~^,b)  of  planar  scaling  and  translation.  is  in  this  case  a  diagonal  matrix  with  entries 
a~^  and  b  is  the  translation  vector.  The  quasi-regular  unitary  representation  over  L^(IR”)  is 
defined  in  this  case  by 

(1.4)  T{g)f{x)  =  kf{A{x  -  6)) , 

where  k  =  \fj  and  J  =  |  det(j4)|.  Gabor-type  representations  originate  from  the  Heisenberg 
group  and  the  associated  Weyl  operational  calculus.  As  such  this  representation  lacks  the 
scaling  that  is  characteristic  of  wavelets.  Scaling  is,  of  course,  important  in  the  analysis  of 
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natural  signals.  However,  the  explicit  harmonic,  or  spectral  nature  of  Gabor-type  represen¬ 
tations  is  just  as  important.  Thus,  it  is  desirable  to  incorporate  scaling  into  the  Gabor-type 
representation.  This  is  accomplished  by  the  generalized  Gabor  scheme  [21],  [35].  In  this 
paper  we  focus  on  signal  and  image  representation  by  multi-window  Gabor-type  schemes, 
where  by  proper  choice  of  the  set  of  windows  we  incorporate  scaling  [35].  We  then  proceed 
to  present  various  aspects  of  signal  and  image  representation  by  partial  information,  mostly 
in  the  context  of  harmonic  analysis.  These  two  subjects  are  of  utmost  importance  in  signal 
processing,  and  have  proven  to  be  useful  in  a  wide  range  of  applications  (not  discussed  in 
this  paper).  The  two  interrelated  subjects  are  also  instrumental  in  gaining  insight  into  the 
structure  of  natural  signals,  and  of  images  as  such.  In  both  cases  we  discuss  the  analysis  as 
well  as  the  synthesis. 


2.  Multiwindow  Gabor-type  frames 

2.1.  Generalized  Gabor-type  schemes  and  the  Zak  transform 

Many  problems  in  physics  and  engineering  involve  the  representation  and  analysis  of 
nonstationary  signals  and  processes.  Such  problems  call  for  the  development  and  application 
of  sets  of  functions  that  are  localized  in  the  sense  that  they  rapidly  decay  in  both  time  (or 
position  in  the  case  of  images)  and  frequency.  Such  are  the  wavelets,  Gabor  functions  and 
the  Wigner  distribution. 

The  two  widely-studied  sequences  of  the  so-called  wavelets  and  Gabor  functions  are  spe¬ 
cial  cases  of  the  following  generalized  sequence: 

(2.1)  57',m,n(^)  ~ 

where  {^^(3^)}  is  a  sequence  of  window  functions,  {ur}  is  a  set  of  real  numbers  and 
is  a  set  of  kernel  functions.  In  the  classical  Gabor  case  [9],  there  is  a  single-window  function 
g{x),  with  Or  =  a  and  b  some  positive  constant,  (t>r,m{3^)  =  and  m^n  e  In  the 

wavelet-type  Gabor  case,  (/>r,m(^)  =  1,  ttr  =  a6^,  r,n  e  Z,  and  g[x) 

is  a  “mother  wavelet”  function. 

Zibulski  and  Zeevi  [35]  analyzed  such  sequences  of  functions  by  developing  a  matrix  al¬ 
gebra  approach  based  on  the  concept  of  frames  and  the  Zak  transform  (ZT).  The  basic  results 
are  presented  here  without  the  proofs.  The  idea  of  using  frames  in  order  to  examine  the  se¬ 
quence  of  Gabor  functions  (and  affine  wavelets  as  well)  was  first  proposed  and  implemented 
by  Daubechies  et  al.  [4];  the  so-called  Weyl-Heisenberg  frames.  The  ZT  of  a  signal  J{x)  is 
defined  as  follows  [14]: 

{Zf){x,  u)  =  /  ^a(x  +  fc)j 

kez 


(2.2) 


~oo  <  x,u  <  oo^ 
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with  a  fixed  parameter  a  >  0.  The  ZT  satisfies  the  following  periodic  and  quasiperiodic 
relations: 


(2.3)  {Zf){x,u  +  1)  -  {Zf){x,u),  (Zf){x-Yl,u)  =  e^^^^{Zf){x,u). 


As  a  consequence  of  these  two  relations,  the  ZT  is  completely  determined  by  its  values  over 
the  unit  square  (j;,  u)  G  ([0, 1)^).  This  is  the  essence  of  this  unitary  mapping. 

Based  on  the  ZT  defined  by  (2.2),  we  define  the  Piecewise  Zak  Transform  (PZT)  as  a 
vector- valued  function  F{x,u)  of  size  p  [34]: 


(2.4) 

where 


F{x,  u) 


Fo{x,u),Fi{x,u) 


?  Fp-i 


(2.5)  Fi{x,u)  =  (Zf)  (^x,u  +  ^  ,  0  ^  i  ^  p  -  1,  i  G  Z. 

The  vector-valued  function  F[x^u)  belongs  to  T^([0, 1)  x  [0,  l/p);C^),  which  is  a  Hilbert 
space  with  the  inner-product: 


r\  rl/p  _ 

(F,G)=  dx  duy2Fi{x,u)Gi{x,u). 
Jo  Jo 


Since  the  ZT  is  a  unitary  mapping  from  L^(IR)  to  I/^([0, 1)^),  the  PZT  is  a  unitary  mapping 
from  L^(]R)  to  I/^([0, 1)  x  [0,  l/p);€^)  .  As  a  consequence  we  obtain  the  following  inner- 
product  preserving  property: 


/OO  _  pL  pi  _ 

f(x)g(x)dx  =  /  /  (Zf){x,u){Zg)(x,u)dxdu 

OO  Jo  Jo 

p1  rl/p  _ 

=  I  dx  du\]Fi{xTu)Gi{x,u). 


This  unitary  property  of  the  PZT  allows  us  the  transformation  from  L^(IR)  to  L^([0, 1)  x 
[0,  l/p);(D^),  where  issues  regarding  Gabor-type  representations  are  often  easier  to  deal  with 
and  understand. 


2.2.  Multi-window  Gabor-type  expansions 

Generalizing  the  Gabor  scheme,  by  using  several  window  functions  instead  of  a  single- 
one,  the  representation  of  a  given  signal  f{x)  e  T^(IR)  is  given  by: 

R-l 

f  {x)  =  ^  ^  ,771, n(^)  > 

r=0  m,n 


(2.6) 
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where 

(2-7)  9r^m.n{x)  =  gr{x  ~~ 

and  {5r(2^)}  is  a  set  of  R  distinct  window  functions.  Such  a  set  can  incorporate,  for  exam¬ 
ple,  Gaussian  windows  of  various  widths.  In  this  case,  with  proper  oversampling,  one  can 
overcome  in  a  way  the  uncertainty  constraint  expressed  by  the  limitations  of  having  either  a 
high  temporal  (spatial)  resolution  and  low  frequency  resolution  or  vice  versa.  This  type  of  a 
richer  Gabor-type  representation  can  be  instrumental  in  applications  such  as  detection  of  tran¬ 
sient  signals  [8],  or  in  identification  and  recognition  of  various  prototypic  textures  and  other 
features  in  images  [22].  Clearly,  if  i?  =  1  we  obtain  the  single- window  Gabor  representation. 

The  characterization  of  the  sequence  {^r,m,n}  can  be  divided  into  three  categories  accord¬ 
ing  to  the  sampling  density  of  the  combined  space  (the  so-called  phase  space  density)  defined 
hyd=  R{ah)~^\  undersampling  -  d  <  1,  critical  sampling  -  d  =  1  and  oversampling  -  d  >  1. 

2.3.  Matrix  algebra  approach  for  the  analysis  of  frames*  properties 

In  order  to  examine  the  properties  of  the  sequence  {gr,m,n},  we  consider  the  operator: 

R-\ 

(2-8)  =  9r,m,n)gr,m,n  , 

r=0  m,n 

which  is  a  frame  operator  if  constitutes  a  frame.  For  the  single-window  Gabor 

scheme,  this  operator  was  examined  by  straightforward  methods  [5, 13],  where  application 
of  the  ZT  was  restricted  to  the  case  ab  =  1.  In  [35],  Zibulski  and  Zeevi  show  that,  if 
the  product  ab  is  a  rational  number,  it  might  be  advantageous  to  examine  this  operator  in 
L^([0, 1)  X  [0,  l/p);€^)  by  using  the  PZT.  A  major  result  of  analysis  in  the  PZT  domain  is  the 
representation  of  the  frame  operator  as  a  finite  order  matrix- valued  function,  as  formulated  in 
the  following  theorem. 

Theorem  1  (Zibulski  and  Zeevi  [35]).  Let  ab  =  pjq,  p,  ^  €  N,  and  let  be  the 
frame  operator  of  the  sequence,  which  is  the  PZT  The  action  ofS^  mX2([0,l)x 

[0,  l/p);€^)  is  given  by  the  following  matrix  algebra: 

(2.9)  {SzF){x^  u)  =  S(a;,  u)F{x^  w), 

where  S{x^u)  is  ap  x  p  matrix-valued  function  whose  entries  are  given  by: 

Si,k{x,u) 

(2.10) 

and  the  vector-valued  function  F{x,u)  is  given  by  (2.4)  and  (2.5), 


V  1  \  9  pj 


i,k  =  Q,---  ,p-l, 
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Since  the  PZT  is  a  unitary  transform,  (2.9)  is  an  isometrically  isomorphic  representation 
of  5  (2.8). 

Using  the  PZT  and  the  matrix  representation  of  the  operator  5,  Zibulski  and  Zeevi  [35] 
examined  the  properties  of  the  sequence  {gr,m,n}  for  a  rational  ab.  Since,  in  the  case  of 
undersampling  the  sequence  {gr,m,n}  is  not  complete,  the  results  presented  next  are  relevant 
in  the  cases  of  critical  sampling  and  oversampling. 

The  next  theorem  examines  the  completeness  of  the  sequence  {5r,Tn,n}  in  relation  to  the 
structure  of  the  matrix-valued  function  S(a;,  u). 

Theorem  2  (Zibulski  and  Zeevi  [35]).  Given  gr  G  L^(IR),  0  ^  r  ^  i?  -  1,  and  a 
matrix-valued  function  S(a:,  w),  (x,  u)  €  ([0, 1)  x  [0, 1/p))  as  in  (2.W),  the  sequence  {pr,m,n} 
associated  with  {gr},  ab  =  p/q,  p,  Q  e  N  is  complete  if  and  only  if  det{S){x,  u)  0  a.e.  on 
[0, 1)  X  [0, 1/p). 

The  frame  bounds  of  the  sequence  {pr,m,n}are  determined  by  the  eigenvalues  of  the 
matrix-valued  function  S(x,  u).  Let 

(2.11)  A^ax(S)  =  esssup(^^„)eQ0,i)x{0,i/p))  max  Ai(S)(j:,  u) 

(2.12)  A^in(S)  -  ess  inf(:,,„)e{[o,i)x[o,i/p))  mm^  Ai(S)(j:,  w), 

where  Ai(S)(a;,u)  are  the  eigenvalues  of  the  matrix  S{x,u).  Then,  the  upper  frame-bound 
B  Xmax{S),  and  the  lower  frame-bound  A  =  Xmin(S).  This  result  yields  the  following 
theorem. 

Theorem  3  (Zibulski  and  Zeevi  [35]). 

The  sequence  {pr,m,n}  associated  with  {pr}»  gr  ^  0  ^  r  ^  i?  —  1,  and  ab  = 

p/q,  p,  ^  G  N  constitutes  a  frame  if  and  only  if  0  <  A,niri{S)  ^  Amax(S)  <  oo. 

An  alternative  approach  to  determining  whether  {pr,m,n}  constitutes  a  frame  is  as  follows. 
The  following  Lemma  formulates  a  necessary  and  sufficient  condition  for  the  existence  of  an 
upper  frame  bound  B  <  oo. 

Lemma  1.  The  sequence  {pr,m,n}  associated  with  {pr},  Pr  ^  0  ^  r  ^  R  -  I 

and  ab  =  p/p,  p,  p  €  N,  has  an  upper  frame  bound  B  <  oo  if  and  only  if  {Zgr){x,  u)  are  all 
hounded  a.e.  on  (0, 1]^  (Zgr  G  L°°((0, 1)^)A 

Theorem  4  below  determines  whether  the  sequence  {gr,m,n)  constitutes  a  frame,  when  an 
upper  frame  bound  exists,  and  which  does  not  necessitate  calculation  of  the  eigenvalues  of 
S(a:,  u). 

Theorem  4  (Zibulski  and  Zeevi  [35]).  Given  gr  G  I/^(IR),  0  ^  r  ^  R  —  I,  such 
that  there  exists  an  upper  frame  bound  B  <  oo  for  the  sequence  {pr,m,n}  associated  with 
{pj,  and  ab  =  p/q,  p,  p  G  N.  The  sequence  {pr,m,n}  constitutes  a  frame  if  and  only  if 
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0  <  ^  det(S)(a;,  u)  a.e.  on  [0, 1)  x  [0, 1/p),  where  the  matrix-valued  function  S(a:,  u)  is 

as  in  (2.10). 

The  following  theorem  concerns  tight  frames  and  the  matrix  representation  of  the  frame 
operator.  Recall  that  a  set  of  functions  {^„}  in  a  Hilbert  space  H  constitutes  a  tight  frame  if 

n 

Theorem  5  (Zibulski  and  Zeevi  [35]).  Given  Qr  e  0  ^  r  ^  R  -  1,  and 

a  matrix-valued  function  8(3;,^)  as  in  (2.10),  the  set  of  functions  {pr,m,n}  associated  with 
{pr},  ah  =  pjq,  p,  ^  G  N  constitutes  a  tight  frame  if  and  only  ifS{x,  u)  =  A1  a.e.,  where  I 
is  the  identity  matrix,  and  A  =  ^  IbriP- 

In  order  to  complete  the  formalism  of  representation,  one  has  to  obtain  the  dual  frame. 
This  can  be  done  by  using  operator  techniques  [5, 7].  For  the  single-window  scheme  the 
dual  frame  {S~^gm,n}  is  generated  by  a  single  dual  frame  window  function  [5, 13].  This  is, 
indeed,  also  the  case  for  the  multi- window  scheme.  Let  {7r,m,n}  denote  the  dual  frame  of 
{pr,m,n} then  { 7r, m,n}  IS  generated  by  a  finite  set  of  R  dual  frame  window  functions  {7^}: 

7r,m,n(a;)  =  jr{x  ~  ,  0  ^  r  ^  i?  -  1, 

where  jr  =  Using  the  matrix  representation  (2.9)  of  the  frame  operator,  the  PZT  of 

7r,  is: 

(2.13)  w)  =  S~^(3;,  u)Gr(3;,tA), 

that  is,  Trix,  u),  Gr{x,  u)  are  vector- valued  functions  in 

L2([0,l)x[0,l/p);C") 

and  S”^(3:,  u)  is  the  inverse  of  the  matrix  S(a;,  u);  for  example, 

S~^{x,u)  =  [det(S)(a;,  w)]“^adj(S)(3:,?z). 

2.4.  The  Balian-Low  theorem  in  the  case  of  multi-windows 

In  the  case  of  a  single- window  Gabor  scheme  with  critical  sampling,  i.e.  ab  =  1,  by 
choosing  an  appropriate  g{x),  the  sequence  {pr,m,n}  can  be  complete  and  constitute  a  frame, 
notwithstanding  the  problems  of  stability  [5].  The  sequence  {pr,m,n}  can,  however,  be  com¬ 
plete  and  not  constitute  a  frame,  in  which  case  the  representation  is  unstable.  A  classical 
example  of  such  an  unstable  scheme  is  the  one  with  a  Gaussian  window  function. 

In  the  case  of  a  single  window  and  critical  sampling,  the  following  theorem  of  Balian  and 
Low  indicates  that  a  wide  range  of  well  behaved  -  rapidly  decaying  and  smooth  -  functions 
g{x)  are  excluded  from  being  proper  candidates  for  generators  of  frames. 

Theorem  6  (Balian-Low  Theorem  [2,5,6]).  .  Given  g  e  a  >  0  and  ab  =  1,  if 

the  sequence  {pm,n}  constitutes  a  frame,  then  either  xg{x)  ^  L^(]R)  or  g'{x)  ^  I/^(]R). 
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Note  that  g'{x)  €  uj9{(jo)  G  L^(IR),  where  g  is  the  Fourier  transform  of  p. 

One  of  the  solutions  for  this  problem  is  oversampling.  In  fact,  it  was  proven  that  in  the 
case  of  a  Gaussian  the  {pm,n}  constitutes  a  frame  for  all  a6  <  1  [15,26]. 

In  the  case  of  critical  sampling  of  the  multi-window  scheme,  an  interesting  question  is 
whether  one  can  overcome  the  constraint  imposed  by  the  Balian-Low  Theorem  by  utilizing 
several  windows.  According  to  the  following  theorem,  if  all  the  windows  in  the  set  {pr}  arc 
well  behaved  functions,  this  is  not  possible: 

Theorem  7  (Zibulski  and  Zeevi  [35]).  Given  gr  G  7/^(IR),  O^r^R-l,  a>0 
and  R{ab)-'^  =  1,  if  the  sequence  {pr,m,n}.  as  in  (2J),  constitutes  a  frame,  then  either 
xgr{x)  ^  L^(IR)  or  g[{x)  ^  L^(IR)  for  some  0  ^  r  ^  R  -  1. 

One  of  the  advantages  of  using  more  than  one  window  is  the  possibility  of  overcoming 
the  constraint  imposed  by  the  Balian-Low  Theorem  on  the  choice  of  window  functions,  by 
adding  an  extra  window  function  of  proper  nature  such  that  the  resultant  scheme  of  critical 
sampling  constitutes  a  frame.  Whether  one  can  find  a  non-well-behaved  window  function, 
complementary  to  a  set  of  well-behaved  window  functions,  such  that  the  inclusive  set  will 
generate  a  frame  for  critical  sampling,  depends  on  the  nature  of  the  set  of  the  well-behaved 
window  functions  as  indicated  by  the  following  proposition. 

Proposition  1.  Let  a  set,  {pj,  O^r^R  - 2,  of  R-l  window  functions  be 
given.  Denote  by  G^{x,  u)  the  R  -  1  x  R  matrix-valued  function  with  entries  u)  = 

2:gr(x,u+  ^),andP(x,u)  =  G^(x,u)G^*  (x,u).  There  exists  a  window  function  gR-i{x) 
such  that  the  inclusive  set  {gr},  0  ^  r  ^  i?  -  1  generates  a  frame  for  the  critical  sampling 
case,  if  and  only  ifO  <  K  ^  det(P)(a;,  w)  a.e,  on  [0, 1)  x  [0,  l/R)- 

An  example  of  jR  -  1  well-behaved  window  functions  satisfying  0  <  K  ^  det(P)(a:,  u) 
a.e.  on  [0, 1)  x  [0, 1  /R),  can  be  constructed  in  the  following  manner.  Take  a  window  function 
g{x)  such  that  the  sequence  {p(x  -  constitutes  a  frame  for  ab  =  R/(R  -  1). 

Note,  that  this  is  an  oversampling  scheme  (l/(a6)  =  {R-  l)/R  <  1)  and  that  there  exist, 
therefore,  well-behaved  window  functions  g{x)  such  that  {p(a:  -  constitutes  a 

frame  (for  example  the  Gaussian  function).  Construct  the  following  R-1  window  functions: 


1) 

Clearly  these  are  well-behaved  window  functions.  Moreover,  we  obtain 

R 

(Zgr){x,u)  =  (2:g){x-r-^^^,u) 

and  the  matrix-valued  function  G^{x,u)  equals  the  matrix-valued  function  G(a:,n)  which 
corresponds  to  the  sequence  {g{x  -  (which  we  denote  by  {pr, m.n})-  In  this 

case  {Pr,m,n}  Corresponds  to  an  undersampling  scheme.  By  the  duality  principle,  since 
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{g{x  —  constitutes  a  frame,  {gr,m,n}  constitutes  a  Riesz  basis  for  a  sub-space 

of  L^(IR).  It  can  therefore  be  shown  that  0  <  K  ^  det(P)(j;,  u)  a.e.  on  [0, 1)  x  [0,  l/R)- 

2.5.  A  wavelet-type  Gabor  scheme 

Consider  a  generalization  of  the  sequence  {gr,m,n} »  where  for  each  window  function 
gr[x)  there  is  a  different  set  of  parameters  ^r‘ 

(2.14)  9r,m,n{x)  =  gr{x  - 

and  the  sampling  density  of  the  combined  space  is: 

r=0 

In  the  case  of  the  single  window,  the  characterization  of  the  sequence  {gr,m,n}  can  be  divided 
into  the  three  categories  of  undersampling,  critical  sampling  and  oversampling  according  to 
d<l,  d=l,  d>l,  respectively. 

In  order  to  analyze  this  kind  of  a  scheme,  we  consider  an  equivalent  one  with  =  a,  br  = 
b  for  all  r  [35]  and  utilize  the  tools  presented  in  the  previous  section  for  the  analysis  of  the 
sequence  properties. 

Utilizing  the  degrees  of  freedom  of  choosing  a  different  set  of  parameters  Or,  br  for  each 
window  function  gr,  we  construct  a  wavelet-type  scheme.  Let  a,  /3  be  positive,  real  numbers. 
Given  a  window  function  g{x)  let 

gr(x)  =  a~’'^‘^g(a~’'x). 

Also,  let  Or  =  I3a^,  and  Orbr  =  R/d  for  all  r,  where  d  is  the  sampling  density  of  the  combined 
space.  We  then  have 

gr,m,n{x)  =  a~’'/^g{a~’'x  - 

In  this  scheme  the  width  of  the  window  is  proportional  to  the  translation  step  o^,  whereas 
the  product  Orbr  is  constant.  This  scheme  incorporates  scaling  characteristics  of  wavelets 
and  of  the  Gabor  scheme  with  the  logarithmically-distorted  frequency  axis  [21].  However,  in 
contrast  with  wavelets,  this  scheme  has  a  finite  number  of  window  functions,  i.e.,  resolution 
levels,  and  each  of  the  windows  is  modulated  by  the  infinite  set  of  functions  defined  by  the 
kernel.  Each  subset  for  fixed  m  can  be  considered  as  a  finite  (incomplete)  wavelet-type  set, 
in  that  it  obeys  the  properties  of  scaling  and  translation  of  a  complex  prototypic  “mother 
wavelef\  For  example,  if  R/d  is  an  integer,  the  mother  wavelet  is  defined  by  g{x)e’^  ; 

if  not,  this  definition  holds  within  a  complex  phase.  Thus,  all  the  functions  corresponding  to 
each  of  these  mother  wavelets  are  self-similar: 

Scaling  has  been  realized  as  an  important  property  of  sets  of  representation  functions 
which  are  used  in  the  analysis  of  natural  signals  and  images.  Hence  the  importance  of  this 
type  of  generalization  of  the  Gabor  scheme. 
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3.  Signal  representation  by  partial  information 

3.1.  Fourier  and  related  transforms 

Let  f{t)  be  a  complex  function  which  belongs  to  Li(-oo,  oo)  or  L2(-oc,  cx:).  The  ordi¬ 
nary  Fourier  transform  denoted  by  /(cj)  and  its  inverse  are  defined  by 


(3.1) 

/(w)  = 

rOO 

(3.2) 

fit)  = 

If  /(w)  is  zero  outside  the  interval  [a,  6]  then 

(3.3)  /(i)  =  ^  ^  |o|,|6|<oo, 

is  called  a  band-limited  function,  i.e,,  j{t)  G  where  is  the  space  of  bandlimited 
functions. 

It  is  well  known  from  the  Paley-Wiener  theoremm  [23]  that  such  a  f{t),  with  t  being 
interpreted  as  a  complex  variable,  is  an  entire  function  of  exponential  type  (EFET).  Due  to 
the  duality  of  the  Fourier  transform  and  its  inverse,  if  f{t)  is  time-limited  to  [a,  ^],  then 

(3.4)  f{uj)=  f  f(t)e-^'^^dt. 

J  a 

In  this  case,  with  cu  being  viewed  as  a  complex  variable,  f{uj)  is  an  EFET.  This  duality  is 
complete  if  we  change  the  argument  sign  and  multiply  (divide)  by  27r.  If  we  define  a  dual 
Fourier  transform  pair  f\t))  by 

(3.5)  f{w)  =  -  j  f'{t)e^-‘dt 

/OO 

f{uj)e-^^‘duj 

■OO 

we  have  perfect  duality  between  time-limited  signals  and  band-limited  signals.  If  f{t)  — 
f{t),  then  f{u))  =  27r/'(~a;).  Thus,  if  f{u)  is  given  by 

(3.7)  f{uj)  =  P{uj)  -f  jQ{u))  =  R{io)  exp{ii9(uj)),  R{lo)  >  0  , 

where  P((j)  is  the  real  and  Q{uj)  the  imaginary  part,  f'(oj)  is  given  by 

(3.8)  27r/'(a;)  =  f{-uj)  =  P{u})  -  jQ{oj) . 

Alternatively,  f{co)  can  be  expressed  in  polar  coordinates 

(3.9)  2nf\u)  =  R(u)  exp(-j?9(a;)) , 
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where  R[ijj)  is  the  magnitude  and  the  phase.  All  subsequent  results  relating  to  time- 
domain  (when  the  signals  are  band-limited  in  frequency)  will  be  used  with  reference  to  the 
frequency-domain  (when  the  signals  are  of  finite  duration  in  time  or  finite  extent  in  space), 
with  proper  interpretation  of  the  duality  that  exists  between  frequency  and  time  domain  rep¬ 
resentations. 

In  general,  the  signals  in  the  time  or  frequency  domain  will  not  belong  to  Li  (— oo,  oo)  or 
I/2(-oo,  oo).  We  will  use  delta  functions  to  describe  discrete  time  signals  or  discrete  Fourier 
spectra.  The  application  of  these  functions  can  be  mathematically  justified  by  introducing  the 
Fourier-Stieltjes  transform  [3],  [23], 

(3.10)  e{u)-e{o)  =  /  — df{t) 

J  —oo  3^ 

(3.11)  m-m)  - 

where  0(cj)  is  the  cumulative  spectrum  and  f(t)  the  cumulative  time  function.  The  Fourier- 
Stieltjes  integrals  are  properly  defined  even  when  the  time  or  frequency  functions  contain 
discrete  components,  while  9{ix))  and  f{t)  are  functions  of  bounded  variation  in  (-oo,  (X)). 
The  duality  of  the  Fourier-Stieltjes  transform  and  its  inverse  is  self-evident  by  their  definitions 
(3.10),  (3.11).  / (t)  is  bandlimited  in  the  Fourier-Stieltjes  sense  if  0{uj)  is  constant  outside  an 
interval  {a,b].  In  a  dual  manner  we  define  time-limited  signals.  In  cases  where  9{uj)  is 
unbounded  in  its  variation,  more  general  results  should  be  used,  such  as  Zakai’s  definition  of 
band-  (or  time-)  limited  signals  [32],  or  generalized  functions  [10].  An  important  result  due 
to  Polya’s  theorem  [17]  or  Paley-Wiener-Schwartz  [10]  ensures  that  if  a  signal  is  bandlimited 
in  the  Fourier-Stieltjes  sense  in  one  domain,  its  counterpart  is  an  EFET. 

A  discrete  complex  time  signal  is  defined  by 

(3.12)  x{t)  =  ^  x„S{t  -  t„), 

n 

and  its  Fourier  transform  by 

(3.13)  x(uj)  =  . 

n 

Using  a  transformation  of  the  whole  complex  plane  into  itself  {u  and  ^  being  viewed  as 
complex  variables), 

(3.14) 

the  generalized  ^^-transform  is  defined  by 

(3.15)  x{z)  =  . 

n 
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The  ordinary  Fourier  sequence  and  ^-transforms  are  derived  through  uniformly  spaced  (nor¬ 
malized)  sampling  =  n.  The  same  (within  the  duality)  definitions  apply  to  cases  where  a 
discrete  signal  describes  the  spectral  function. 

We  use  properties  of  a  class  of  EF^T  called  B-functions.  These  include  all  functions  of  fi¬ 
nite  bandwidth  (according  to  various  definitions  of  bandwidth).  The  properties  of  B-functions 
are  well  established  by  classical  theory  of  entire  functions,  and  a  complete  characterization  of 
their  zeros  is  known.  Furthermore,  for  each  B-function  there  corresponds  a  unique  expansion 
into  the  product  of  its  zeros,  determined  by  the  Hadamard  factorization  [23]: 

(3.16)  f{s)  =  lim  TT  (1  -  s/s„) , 

r— >oo  ^ 

\sn%r 

where  f{s)  is  a  B-function,  c  a  complex  constant,  m  an  integer,  k  a  real  constant,  and  s„ 
are  the  complex  roots  of  f{s).  Convergence  of  (3.16)  is  conditional  on  correct  order  of  the 
zeros  [23].  This  form  of  expansion  shows  that  a  band-limited  function  is  uniquely  determined 
(within  some  parameters)  by  all  zeros. 

3.2.  Time  and  position  (Le.  2D)  domains  -  zero  crossings 

Logan’s  work  [17]  addresses  the  issue  of  whether  a  one-dimensional  bandpass  signal 
can  be  uniquely  determined  by  its  real  zero  crossings.  His  work  characterizes  analytically 
the  class  of  these  interesting  signals,  and  proves  that  a  bandpass  signal  can  be  uniquely  de¬ 
termined  by  its  real  zero  crossings  (within  a  multiplicative  constant)  even  if  it  has  complex 
zeros.  Logan’s  existence  theorem  defines  the  conditions  that  must  be  satisfied  in  order  to  have 
a  representation:  1.  The  bandpass  function  and  its  Hilbert  transform  should  have  no  zeros  in 
common  other  than  real  simple  zeros,  and  2.  The  bandwidth  of  the  signal  must  be  less  than 
an  octave.  These  common  zeros  of  the  bandpass  signal  and  its  Hilbert  transform  (free  zeros 
in  Logan’s  terminology)  are  those  that  may  be  removed  or  moved  around  without  destroying 
the  bandpass  property  of  the  signal. 

The  formal  conditions  for  a  real  bandpass  signal  representation  by  its  zero  crossings  are 
stated  in  Theorem  8,  and  for  real  periodic  function,  as  a  special  case,  in  Theorem  9. 

Theorem  8  (Real  Bandpass  Signal  Version  (Logan  [17])). 

Let  h{t)  =  Re{f{t)e^‘^^^)  be  a  real  bandpass  function,  where  f{t)  is  bandlimited  and 
bounded.  h{t)  is  uniquely  determined  by  its  real  zero-crossings  within  a  multiplicative  con¬ 
stant,  if  h{t)  has  no  zeros  in  common  with  its  Hilbert  transform  h{t)  =  Im(f  (free 

zeros)  other  than  real  simple  zeros,  and  if  the  band  is  less  than  one  octave  in  width. 

Theorem  9  (Real  Periodic  Functions  (Poggio  et  al.  [20])). 

Let  hs  (t)  be  a  real  periodic  bandpass  function 

N 

n=-N 


(3.17) 
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where  Cn  is  a  discrete  complex  sequence,  and  *  denotes  the  conjugate  operation.  The  function 
hg  {t)  is  uniquely  determined  by  its  real  zeros  within  a  multiplicative  constant  provided  hg  (t) 
and  its  Hilbert  transform  hg  (f),  given  by 

N 

(3.18)  hs{t)  =  ^2  ^  sign{n)cn , 

n=—N 

have  no  zeros  in  common  except  real  simple  zeros,  and  c„  =  0  for  |nl  N/2  (or  {N  + 
l)/2  when  N  is  odd ). 

It  should  be  noted  that  these  are  existence  theorems  and  as  such  they  define  the  necessary 
and  sufficient  conditions.  However,  they  do  not  provide  an  algorithm  for  signal  reconstruction 
from  its  zero  crossings,  nor  do  they  address  the  important  issue  of  stability.  Algorithms  for 
one-dimensional  signals  were  put  forward  by  Voelecker  and  Requicha  [31],  These  algorithms 
were  modified  by  Rotem  and  Zeevi  [24],  and  applied  in  reconstruction  of  images  from  their 
zero  crossings. 

Theorem  10  (Bandpass  Signals  in  Two  Dimensions). 

( Rotem  and  Zeevi  [24])  Let  f{x,y)  be  a  real  function  that  satisfies  the  following  conditions: 

(i)  I 

(ii)  f^{x,yo)dx  <  oo  for  a  certain  y^  (such  y^  must  exist  due  to  (i)). 

(iii)  y)dy  <  oofor  {a^n}  defined  below  (see  (vii)). 

(iv)  The  two-dimensional  Fourier  transform  off{x,  y) 

oo 

(3.19)  />,  v)  =  11  fix,  y)e-^('‘-+‘'y^dx  dy 


(3.21) 


0<a<5<2a;  0<c<d<2c. 


(v)  For  yo  in  (ii),  the  one-dimensional  function  of  x,  f(x^yf),  and  its  Hilbert  transform 
f{x,yo)  do  not  have  zeros  in  common  except  real  zeros  of  degree  one. 

(vi)  For  {a^n}  in  (iii),  the  one-dimensional  function  in  y,  /(x„,  y),  and  its  Hilbert  transform 
f{xn,  y)  do  not  have  zeros  in  common  except  real  zeros  of  degree  one. 

(vii)  The  set  of  points  {a;„}  constitutes  a  sampling  set  for  B(a,  b)  by  having  density  greater 
than  [h  —  a)/7r.  More  precisely:  if  N{L)  is  the  number  of  sampling  points  on  the 
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interval  (0,  L),  then  the  sufficient  condition  is 

(3.22)  lim  sup  ^  ^  . 

L — >oo  'JT 

Then,  if  there  exists  another  function  g{x,  y)  which  also  satisfies  conditions  (i)-(vii), 
and  if 

(3.23)  sign{g(x,  y))  =  sign{f{x,  y)), 
we  have 


(3.24)  g(x,y)  =  afix,y) . 

That  is,  /  and  g  are  identical  within  a  multiple  constant.  The  proof  is  a  direct  extension 
of  Logan’s  theorem  [17].  Since  the  one-dimensional  functions  f{xn,  y)  and  /(x,  yo)  satisfy 
Logan’s  conditions,  we  can  construct  the  functions  along  vertical  lines  x  =  a;„,  scale  them 
according  to  the  function  on  the  horizontal  line  yo,  and  then,  having  a  sampling  set  {a^n},  the 
value  of  f{x,  y)  is  determined  (up  to  a  constant)  at  every  point  in  the  plane.  Obviously,  x  and 
y  can  be  interchanged  in  conditions  (ii)-(vii)  and  in  the  proof  [24]. 


Theorem  11  (Bandpass  in  One  Dimension,  Lowpass  in  Other). 

(Rotem  and  Zeevi  [24])  Let  f(x,  y)  be  a  real  function  that  satisfies  the  following  conditions: 

W  /  ST;y:,P{^,y)dxdy  <  00. 

('9  /  !Poo  Pi^’  yn)dx  <  CO  for  {;/„}  defined  below. 

(iii)  The  two-dimensional  Fourier  transform  of  f(x,  y)  satisfies 

N  ^  (  u  < -bU -a  <  u  <  aUb  <  u 

(3.25)  f{u,  i/)  =  0  ^  ^ 

where 


(3.26)  0<a<6<2a;  0<c<oo. 

(iv)  For  {yn}  in  (ii),  the  one-dimensional  function  in  x,  /(x,  ?/„)  and  its  Hilbert  transform 
fi^^yn)  do  not  have  zeros  in  common  except  real  zeros  of  degree  one. 

(v)  There  exists  a  straight  line  £  of  angle  0  to  the  x  axis,  where 

2a  —  b 

(3.27)  tan  0  <  — - — 

3c 

(by  (iii),  b  <  2a,  so  0  >  0)  such  that  the  one-dimensional  function  f{s)  satisfies  along 

I 

/oo 

f{s)ds  <  oo . 

-oo 

(vi)  The  function  f{s)  and  its  Hilbert  transform  f{s)  do  not  have  zeros  in  common  except 
real  zeros  of  degree  one. 
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(vii)  The  set  of  points  {?/„}  constitutes  a  sampling  set  for  B(c). 

Then,  if  there  exists  another  function  g{x,y)  which  also  satisfies  conditions  (i)-(vii),  and  if 

(3.29)  sign{g{x,  y))  =  sign{f(x,  y)), 
we  have 

(3.30)  g{x,  y)  =  af{x,  y) . 

In  this  case,  the  function  is  determined  on  horizontal  lines  pn  and  scaled  on  the  diagonal 
i.  Having  {pn},  a  sampling  set  for  B(c),  we  can  determine  the  value  for  every  point  between 
the  lines. 

The  conditions  can  be  interchanged  so  that  the  function  is  determined  on  multiple  diago¬ 
nals  and  one  horizontal  line,  or  another  diagonal  angle  less  than  0  relative  to  the  x  axis. 

The  importance  of  formulating  the  conditions  of  free  zeros  in  the  case  of  a  two-dimen¬ 
sional  signal  (e.g.  an  image),  is  in  the  observation  that  these  conditions  have  to  be  satisfied 
only  on  a  sparse  set  of  lines  in  the  plane.  Furthermore,  if  the  signal  is  periodic  (and  every  finite 
image  can  be  considered  as  one  period  of  a  periodic,  infinite  signal,  thus  enabling  bandpass 
conditions  to  exist)  -  only  a  finite  number  of  lines,  out  of  the  infinite  number  of  lines  in  the 
plane,  have  to  satisfy  the  free-zero  conditions.  Thus,  even  if  the  strict  condition  of  having  no 
free  zero  is  not  realized  in  an  image  which  satisfies  one  of  the  bandpass  conditions,  the  image 
is  still  likely  to  be  represented  by  its  zero  crossings. 

3.2.1.  Reconstruction  of  images  from  their  zero  crossings.  The  proposed  algorithms  directly 
follow  the  proofs  of  Theorems  10  and  1 1.  If  the  image  satisfies  the  conditions  of  Theorem  10, 
the  one-dimensional  functions  on  several  rows  (or  columns)  are  reconstructed,  using  an  iter¬ 
ative  algorithm  described  below.  Then,  another  one-dimensional  function  is  reconstructed 
along  a  column  (or  a  row),  and  used  for  scaling  the  previous  functions.  If  the  image  satisfies 
the  conditions  of  Theorem  1 1,  the  scaling  is  done  along  a  diagonal  at  the  proper  angle.  The 
rest  of  the  image  is  reconstructed  by  interpolation. 

The  reconstruction  algorithm  is  as  follows:  Given  the  clipped  signal  5,  pass  it  through 
an  ideal  bandpass  filter  to  yield  Sq.  Designating  the  bandpass  operation  as  obtain  the 

initial  signal 

(3.31)  So  =  Bp{signS} 
and  then  continue  iteratively 

(3.32)  Sj+i  ==  Sn-  c[Hp{sign  5n}  -  ^o] . 

The  algorithm  converges  when  sign(5'i)  =  sign  (S').  It  can  be  shown  that  the  process  of  these 
successive  approximations  is  contracting.  If  Logan’s  conditions  for  uniqueness  are  met,  then 
Sn  =  aS,  where  a  is  some  constant.  This  algorithm  yields  good  results  for  most  bandpass 
signals,  with  the  number  of  iterations  varying  from  a  few  up  to  100.  If  sign{Si{nT))  = 
sign{5'(nf))  for  every  n,  there  can  still  exist  another  continuous  signal  S'{t)  where  S'{nT)  = 
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Si{nT)  but  sign(5'(t))  ^  sign(5'(t)),  so  S'[t)  ^  S{t)  and  also  Si{nT)  ^  aS{nT),  This 
ambiguity  can  be  viewed  as  a  result  of  real-zeros  jitter  that  may  also  change  the  location  of 
the  complex  zeros  [24].  It  should  be  noted,  however,  that  the  dependence  of  the  deviation  of 
the  complex  zeros  on  that  of  the  reals  zeros  is  closely  related  to  the  question  of  stability.  This 
issue  is  dealt  with  elsewhere  [33]. 

The  error  analysis  indicates  some  simple  ways  to  reduce  the  errors  in  the  scaling  factors. 
For  example,  select  a  column  along  which  the  reconstructed  signal  converged;  locate  the 
maxima  of  the  signal.  (On  these  points  the  normalized  errors  are  minimal.)  Reconstruct  the 
signals  along  some  rows  intersecting  the  selected  column  at  the  peak  points  and  scale  them 
according  to  the  peak  values.  Then  reconstruct  the  signals  in  the  other  columns.  The  inter¬ 
section  points  of  each  column  with  the  rows  form  two  vectors:  that  of  the  (as  yet  unsealed) 
values  of  the  signal  along  the  column  and  that  of  the  scaled  values  of  the  signals  along  the 
rows.  Denote  the  unsealed  values  along  the  column  by  c{k)  and  the  values  of  the  rows  by 
r[k).  If  there  are  M  rows,  then  the  optimal  scaling  factor  for  the  column  is 


(3.33) 


In  this  way  the  effect  of  reconstruction  errors  in  the  columns  and  the  rows  tends  to  cancel 
out  [24]. 

The  same  method  can  be  applied  to  images  which  are  bandpass  in  one  dimension  and 
lowpass  in  the  other  (Theorem  11).  The  scaling  is  carried  out  in  this  case  by  reconstruction 
on  several  diagonals  intersecting  one  column  at  its  maxima,  scaling  the  diagonals,  and  then 
scaling  the  other  columns  by  values  of  the  points  of  intersection  with  the  diagonals. 

In  order  to  have  an  ideal  bandpass  (or  lowpass)  image,  the  signal  should  be  looked  upon 
as  periodic,  the  actual  image  being  one  period  (in  the  two-dimensional  sense).  The  conditions 
of  finite  Fourier  integrals  should  in  this  case  be  modified  for  periodic  signals. 

The  bandpass  operation  can  be  carried  out  by  calculating  the  FFT  of  each  column  and 
simply  deleting  all  components  out  of  the  passband,  and  then  transposing  the  image  and 
repeating  the  operation.  This  is  equivalent  to  convolving  the  image  with  an  infinite,  periodic 
bandpass  filter. 

Prior  to  performing  the  sign  operation,  it  is  recommended  to  interpolate  the  signal  so  as  to 
reduce  the  zero-crossings  jitter.  Interpolation  is  also  needed  in  creating  the  diagonals,  so  that 
they  can  be  inclined  at  an  angle  of  less  than  45°.  In  order  to  create  a  bandpass  signal  along 
the  diagonal,  the  signal  should  also  be  periodic.  This  is  done  by  continuing  the  diagonal 
along  some  periods  of  the  image  until  it  reaches  the  starting  point.  Because  of  the  assumed 
periodicity  of  the  image,  continuing  the  diagonal  means  “folding”  from  bottom  to  top  each 
time  it  reaches  the  last  row. 
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3.3.  Frequency  domain  -  discrete  signals 

We  now  present  theorems  concerning  one-dimensional  discrete  signal  representation  by 
partial  information  in  the  frequency  domain.  These  include  the  cases  of  phase  only,  sampled 
phase,  one  bit  of  phase,  magnitude,  and  signed-magnitude.  The  theorems  are  quoted  as  stated 
in  the  original  papers. 

Theorem  12  (Hayes,  Lim  and  Oppenheim  [12]).  Let  x{n)y  y{n)  be  two  finite  length 
real  sequences  whose  z-transforms  have  no  zeros  in  reciprocal  conjugate  pairs  or  on  the  unit 
circle.  If  'd^{u)  —  dy[u))  for  all  u,  then  a:(n)  —  Py{n)  for  some  positive  constant  If 
tan  ^{(jo)  =  tan  'Oy{uj)  for  all  u,  then  x{n)  —  fiy{n)for  some  real  constant  fi. 

and  'dy{(jj)  are  the  respective  phases  of  the  Fourier  sequences  ofx{n)  and  y{n). 

The  results  have  been  extended  to  complex  sequences  and  the  condition  prohibiting  sim¬ 
ple  zeros  on  the  unit  circle  relaxed. 

Theorem  13  (Hayes  [11]).  Let  x{n),  y[n)  be  complex  sequences  which  are  zero  outside 
the  interval  [0,  N  —  1],  with  z-transforms  having  no  zeros  in  conjugate  reciprocal  pairs  or 
on  the  unit  circle.  If  M  ^  2N  —  1  and  —  dy{(jS)  at  M  distinct  frequencies  in  the 

interval  [0, 27r],  then  x{n)  —  fiy{n)  for  some  positive  real  number  fi.  If  M  2N  —  1 
and  tan(?5^(ct;))  =  tan(??y{a;))  at  M  distinct  frequencies  in  the  interval  [0,  27r],  then  x(n)  = 
/3y(n)  for  some  real  number  /?. 

Theorem  14  (Hayes,  Lim  and  Oppenheim  [12]).  Let  x(n)  be  a  real  sequence  which  is 
zero  outside  the  interval  [0,  W  —  1]  with  x{0)  ^  0  and  \vhich  has  a  z -transform  with  no  zeros 
in  conjugate  reciprocal  pairs  or  on  the  unit  circle.  Let  y[n)  be  any  real  sequence  which  is 
also  zero  outside  the  interval  [0,  N  —  1].  Ifij^i^)  =  at  N  —  1  distinct  frequencies  in  the 
interval  (0,  tt),  then  y{n)  =  px[n)  for  some  positive  constant  p.  Iftm{'dx{oj))  =  tan(i9y 
for  N-1  distinct  frequencies  in  the  interval  (0,  tt),  then  y{n)  =  fix{n)  for  some  real  constant 

Theorem  15  (Hayes,  Lim  and  Oppenheim  [12]).  Let  x{n)  and  y{n)  be  two  real  sequen¬ 
ces  whose  z-transform  contain  no  reciprocal  pole-zero  pairs  and  which  have  all  poles,  not  at 
z  —  oo,  inside  the  unit  circle.  If  the  magnitude  of  the  Fourier  transforms  of  x{n)  and  y(n) 
are  equal,  then  x{n)  —  ±y(n  -f  m)  for  some  integer  m. 

Theorem  16  (Thm.  2  of  Van  Hove,  Hayes,  Lim,  Oppenheim). 

[29]  Let  x{n)  and  y[n)  be  two  real,  causal  (or  anticausal),  and  finite  extent  sequences  with 
z-transforms  which  have  no  zeros  on  the  unit  circle.  If  the  signed  Fourier  magnitudes  of  the 
sequences  (denoted  by  and  Gy[u),  respectively,  and  defined  by  (3.34)  and  (3.35))  are 

equal  for  all  oj,  G^{oj)  =  Gy{u))  0  <  a  <  ir,  then  x(n)  ~  y{n).  When  a  =  77  (or  0),  and  if 
Gl{uj)  —  G*y{uj)  for  all  u)  and  a;(0)  =  2/(0),  then  x{n)  =  y{n). 

The  signed  Fourier  magnitude  G^{uj)  and  Fourier  magnitude  Rx{uj)  are  related  through 
(3.34)  G^(u;)  =  5“(a;)|i2,(a;)|, 
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where  the  bipolar  function  defined  by 


(3.35) 


f  +1  a  -  TT  ^  <  a 

I  -1  otherwise 


incorporates  one  bit  of  phase  information  at  each  frequency  [19],  [29]. 

Theorem  17  (Oppenheim,  Lim,  and  Curtis  [19]).  Let  x{n),y{n)  be  two  real,  finite  ex¬ 
tent,  causal  (or  anticausal)  sequences  with  all  zeros  outside  (inside)  the  unit  circle.  If 

S^ioj)  =  S^iu),foralluj, 

q;  =  0,  TT  then  x{n)  =  fiy{n),  where  fi  is  a  scale  factor.  (For  a  =  7r/2,  is  a  positive  factor.) 


4.  Duality  of  results  in  time  and  frequency  domains 

Based  on  the  duality  of  the  Fourier-Stieltjes  transform  and  its  inverse,  relevant  well- 
known  results  for  time  domain  problems,  concerned  with  unique  representation  of  bandlim- 
ited  functions  by  partial  information,  were  applied  by  Shitz  and  Zeevi  [27]  in  order  to  de¬ 
rive  results  about  unique  representation  of  a  Fourier  transformed  discrete  or  continuous  time 
signal  by  partial  information  in  the  Fourier  domain.  This  approach  not  only  enables  the 
derivation  of  new  results  for  the  continuous  signals,  but  more  importantly,  also  highlights  the 
interrelationship  of  the  problems  in  the  two  domains. 

Shitz  and  Zeevi  [27]  have  derived,  and  in  various  cases  extended  the  results  concerning 
signal  representation  by  partial  information  in  the  frequency  domain  by  application  of  Lo¬ 
gan’s  results  (Theorems  8  and  9),  and  those  of  Voelecker  [30].  In  the  sequel  we  only  outline 
their  approach. 

Let  x{t)  be  a  complex,  finite  time  signal;  x{t)  equal  to  zero  outside  the  interval  [0,  T).  Let 
^(a;)  be  the  Fourier  transform  of  x{t),  assumed  to  exist  at  least  in  the  Fourier-Stieltjes  sense. 


Then 

roo 

r'^ 

(4.1) 

11 

O) 

1 

e 

cC 

li 

J -oo  J 

0 

(4.2) 

x{t)  =  /  x(L0)e^^^duj 

27r  7_oo 

(4.3) 

j(cj)  :=  Px{(-^)  +  jQx{oj) 

=  7i!^(a;)  exp(i79a:(w)) 

R^{uj)  >  0, 

where  Qx{^),Rx(^),  and  i9a;(6j)  are,  respectively,  the  real  part,  imaginary  part,  mag¬ 

nitude,  and  phase  of  ^(w). 
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4.1.  Signal  representation  by  its  Fourier  phase 

Theorem  18  (Shitz  and  Zeevi  [27]).  Let  rc(t)  be  a  complex  finite-time  signal  the  Four¬ 
ier  transform  of  which  x{u)  has  no  complex  conjugate  zeros.  The  signal  x{t)  is  uniquely 
determined  (within  a  positive  multiplicative  constant)  by  its  Fourier  phase  Vo;.  lfx[v) 

has  no  conjugate  zeros  at  all  (neither  complex  nor  real),  then  x{t)  is  uniquely  defined  (within 
a  real  constant)  by  tan(^?a;(a;)). 

This  version  of  Theorem  12  is  valid  for  continuous  signals.  Its  discrete  version,  obtained 
by  using  the  discrete  model 

(4.4)  x{n)  -  ^  XnS{t  -tn)  0  ^  <  T 

n 

and  by  interpreting  the  free  zeros  (see  Theorem  8)  as  reciprocal  conjugate  zeros  (in  the  z- 
plane),  turns  out  to  be  an  extended  version  of  Theorem  12.  In  fact,  x(n)  may  under  this 
generalization  be  a  countable  sequence  provided  0  <  <  T  for  all  n  (in  may,  for  example, 

be  a  convergent  sequence)  and  En|rrn|  <  oo.  This  result  is  evident  due  to  the  entireness  of 
x{i')  which  contains  a  countable  but  not  necessarily  finite  number  of  zeros  (3.16).  (Theo¬ 
rem  12  is  obtained  by  choosing  T  =  N^tn  —  N  and  0  <  n  <  N). 

Theorem  2  of  Hayes,  Lim,  and  Oppenheim  [12],  which  is  the  no-poles-in-reciprocal-pairs 
version  of  Theorem  12,  follows  the  latter  and  can  be  derived  by  using  an  anticausal  signal 
x{t)  ^  0  for  —T  <  f  ^  0,  or  alternatively  by  using  the  transformation  2:  =  instead  of 

An  alternative  and  much  simpler  derivation  of  Theorem  12  is  possible  by  application  of 
Theorem  9  which  already  deals  with  discrete  signals  (trigonometric  sequences).  However, 
under  this  approach  the  obtained  theorem  is  restricted  to  discrete  signals. 

4.2.  Signal  representation  by  its  sampled  Fourier  phase 

Consider  a  finite  extent  complex  sequence  dn,  which  equals  zero  outside  the  interval 
[0,  A  -  1],  The  discrete  Fourier  transform  d{Lu)  of  is  given  by 

iV-l 

d{uj)  = 

n=0 

(4.5)  d{uj)  =  P^{u))+jQ^{uj)  =  R^(u)exp(jd^{u)) 

where  P^{(jj),Q^{uj),R^{u)),  and  are  the  real  part,  imaginary  part,  magnitude,  and 

phase  of  d{u).  The  transform  is  extended  to  the  whole  complex  plane  by  replacing  u  with 
U  =  CJ  +  ju. 

Similarly  to  our  consideration  of  the  continuous  case,  we  define  a  new  sequence  a„  which 
is  time  constrained,  or  “bandpass  in  time”. 

(4.6)  an  =  0.5(d„_M  4-  d^n-u)  -^  >  0- 
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Its  Fourier  sequence 

(4.7) 

A/+yv-i 

d(i/)  =  E 

n^-{M+N-l) 

is  alternatively  written  as 

the  sum  of  two  sequences 

(4.8) 

a{u)  =  M&{i')  +  Na{u) 

defined  by 

(4.9) 

-M 

n=-M-(7V~l) 

(4.10) 

Na{u) 

M+{N~\) 

n=M 

(4.11) 

d(i/) 

=  Re{d{u)e~^''^)  =  R^(y)  cos{Mp  - 

Q;(a;)  is  real  since  —  d*_^. 

The  Hilbert  transform  g(i/)  of  a{i')  is  defined  by 

M+{N~1) 

(4.12)  a{iy)  =  ^  ,  an  =  i  sign{n)an 

(4.13)  q(i')  =  R^(i/)sm{Mi'  -  d^iiy)) . 

Now  we  can  apply  Theorem  9  [20].  The  one  octave  bandwidth  requirement  is  satisfied  by 
choosing  M  >  {N  -  1).  We  note  that  a(-i/)  is  equivalent  to  h{t)  in  the  notation  of  [20]. 
Interpreting  this  work  in  the  context  of  our  own  notation,  it  was  shown  that  if  Uq  is  a  free  zero 
of  q;(“Z/),  then  both  this  zero  and  its  conjugate  v^q  are  zeros  of  Na(jy)  [20,  eq.  (6)]: 

(4.14)  N^(iyo)  =  N^{iy*o)  =  0. 

Derivation  of  Theorem  13  is  straightforward,  by  application  of  the  formalism  provided  in 
[20].  Accordingly  q;1(z^)  is  the  Fourier  sequence  of  another  time  bandpass  sequence  aln 
which  is  zero  for  |n|  ^  [M,  M  +  (A  -  1)]  and  defined  similarly  to  (4.6),  where  is  replaced 
by  dl„.  It  was  shown  in  [20]  that 

a(w)a(i/)l(aj) -g(i/)l(a))dl(w)  = 

N-1 

-(N-i) 


(4.15) 
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This  equation  can  be  interpreted  as  an  appropriate  convolution  of  two  complex  sequences  of 
length  N,  one  of  them  being  causal  and  the  other  anticausal. 

It  was  shown  in  [20]  that  the  Vandermonde  determinant  associated  with  the  real  roots  of 
(4.15)  is  not  equal  to  zero  if  Logan’s  conditions  are  satisfied,  so  if  for  at  least 

2N  -1  distinct  frequencies  in  the  interval  (0, 27r)  then  Cn  in  (4.15)  all  equal  zero.  Therefore 
sin(i^j(a;)  -  ~  ^  ^  equivalently,  tan ^^^^‘(cj)  =  tan for  all  lj. 

Logan’s  condition  prohibiting  complex  free  zeros,  when  expressed  in  the  context  of  2:, 
dictates  a  no  reciprocal  conjugate  zeros  condition,  but  still  allows  zeros  on  the  unit  circle.  By 
the  fact  that  tan  19^2(0;)  =  tan  and  through  Theorem  12  we  have  dn  =  l^dln  for  some 

real  /?,  and  Theorem  13  follows.  It  is  emphasized,  though,  that  in  this  case  zeros  on  the  unit 
circle  are  not  allowed  except  for  the  case  when  the  points  oji  at  which  is  specified  are 

all  the  real  zeros  of  q:(u;),  as  discussed  in  the  preceding  subsection. 

Theorem  5  of  Hayes,  Lim,  and  Oppenheim  [12]  is  a  special  case  of  Theorem  13  consid¬ 
ering  a  real  sequence  The  phase  ?9j(0)  at  a;  =  0  is  always  zero.  It  is  defined  in  this  case 
for  at  least  N  distinct  frequencies  over  the  interval  [0,  tt),  provided  M  ^  N  —  1.  Theorem  4 
of  [12],  which  is  equivalent  to  Theorem  3  of  [12],  can  be  derived  in  the  same  way,  using 
anticausal  sequences  or  the  complex  transformation  2:  = 

4.3.  Derivationof  Theorem  14 

Theorem  14  in  its  complex  version  is  derived  by  directly  applying  Theorem  9.  Through 
(4.15)  we  reach  the  following  equality:  if  i9^(c(;)  =  ^^^^(cc;),  at  least  for  2N  —  1  distinct 
frequencies  in  the  interval  (0, 27r),  then 

a(a>)  _  &l(a)) 

(4‘  lb)  ^  \  ''  1 1  \ 

a[uj)  »1(^) 

(4.17)  tan  d^{oj)  =  tan  (ct;)Va; . 

Since  a{u)  satisfies  Logan’s  conditions,  it  can  be  uniquely  determined  within  a  real  constant. 
If  al{u)  also  does  not  possess  forbidden  free  zeros  (in  Logan’s  sense),  the  theorem  is  proved 
immediately.  If,  however,  it  does  possess  such  free  zeros,  complex,  or  real,  multiple  zeros, 
then  we  can  identify  d!l(tj)  within  a  real  function  b{u;)  which  is  positive  or  negative  for  all  w, 
i.e., 

(4.18)  Q;l(a;)  =  6(61;) Q!(a;) . 

^From  (4.18)  we  thus  realize  that  is  a  convolution  between  an  and  some  conjugate  sym¬ 
metric  sequence  so  that  al^  cannot  be  time-bandpass  in  the  same  band  as  a„,  |n|  e 
[M,  M  -f  (V  —  1)].  This  is  possible  only  when  6(0;)  ~  j3,  p  being  a  real  constant.  Again,  if 
the  equality  refers  to  ^?j(u;)  =  d^^{u)  for  distinct  2N  —  1  frequencies  in  (0, 27r),  the  signum 
function  of  6:1(0;)  and  6(0;)  is  uniquely  defined,  so  that  6(0;)  in  (4.18)  can  be  only  a  positive 
function  and  the  multiplication  factor  p  is  therefore  positive.  It  should  be  emphasized  that 
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free  zeros  on  the  real  axis  of  a(a;)  are  not  permitted  according  to  this  theorem,  except  for  the 
case  when  the  instants  coi,  at  which  is  specified,  are  all  real  zero  crossing  instants  of 

q;(lj)  (because  for  zero  crossing  Logan  permits  simple  free  zeros  to  occur).  Theorem  14  is 
merely  the  real  version  of  the  theorem  derived  here,  restricted  to  real  sequences. 

Theorems  6  and  7  (in  Hayes  et  al.  [12])  are  dual  of  Theorem  14  here  for  pole-only  se¬ 
quences.  They  can  be  derived  using  the  methods  implemented  in  the  case  of  the  dual  Theo¬ 
rems  12  and  13  for  pole-only  sequences. 

4.4.  Signal  representation  by  Fourier  magnitude 

We  turn  back  to  a  continuous  causal  case  where  a:(i)  is  defined  by  (4.2)  and  x{uj)  is  an 
analytic  signal  expressed  by 

X{U))  =  P:,{U))  +  =  Pxiyj)  - 

(4.19)  =  /?^(a;)exp(j?93,(w)),  R^{u})  >  {) . 

This  is  an  analytic  representation  of  a  lower  sideband  signal  in  the  frequency  domain,  with 
Px{u))  being  real.  Equivalently,  (27r)~^f  (-cj)  is  an  analytic  representation  of  an  upper  side¬ 
band  [30],  [18]. 

The  question  in  which  cases  (27r)“^^(— u;)  [or  equivalently,  x (a;)]  is  determined  uniquely 
by  its  magnitude  is  equivalent  to  the  problem  of  determining  when  a  single-side-band  (SSB) 
signal  can  be  unambiguously  demodulated  (within  a  sign  factor)  by  an  amplitude  modulation 
(AM)  detector.  We  apply  Voelecker’s  [30]  results  directly  to  (2it)~'^x{-(j),  and  extend  The¬ 
orem  15  to  the  continuous  case.  According  to  Voelecker,  x{~j/)  [or  x{u)]  cannot  have  zeros 
in  either  the  upper  half  plane  (UHP)  or  lower  half  plane  (LHP).  Substituting  z/  =  a;  -f-  ju,  we 
have  that  li  >  0  and  w  <  0  [or  vice  versa  with  respect  to  x(i/)].  Accordingly,  we  have  for 
continuous  signals  the  following  theorem. 

Theorem  19  (Shamai  and  Zeevi  [27]).  Let  x{t)  be  a  complex  finite-time  signal  whose 
Fourier  transform  x{u)  possesses  no  zeros  either  in  the  lower  half-plane  (LHP)  or  in  the 
upper  half-plane  (UHP).  Then  x[t)  is  uniquely  determined  by  the  Fourier  magnitude  Rx{(^) 
within  a  constant  time  shift  and  a  complex  phase  factor 

Representing  a  discrete  time  signal  by  (4.4)  and  applying  the  complex  transformation 
z  =  we  derive  the  all  zero  version  of  Theorem  15  for  complex  sequences.  The  zeros  at  the 
origin  of  m  being  an  integer,  are  permitted  since  they  form  pure  delays.  Because  these 
delays  cannot  be  identified  by  the  magnitude,  we  have  the  ambiguity  a:(n)  =  ±y(n  +  m). 
The  formal  derivation  which  includes  terms  of  is  obtained  directly  by  defining  x{t)  to 
be  zero  outside  an  interval  (To,  T),  where  T  >  To  >  0.  The  pole-zero  version  of  Theorem  15 
is  derived  by  resorting  to  the  same  reasoning.  However,  x{t)  should  in  this  case  be  defined  as 
a  convolution  between  two  appropriate  causal  (all-zero)  sequences  one  of  which  has  only  the 
zero  terms  of  Theorem  15  and  the  other  zeros  which  are  the  reciprocals  of  the  poles  defined 
in  that  theorem.  The  new  signal  is  also  an  analytic  signal,  and  Theorem  15  follows. 
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Since  Theorem  8  of  Hayes,  Lim,  and  Oppenheim  [12]  is  just  a  mirror  version  of  Theo¬ 
rem  15,  it  can  be  derived  by  the  same  arguments  using  the  lower  half-plane  instead  of  the 
upper  one  or  vice  versa, 

4.5.  Signal  representation  by  Fourier  signed-magnitude 

Applying  well-known  results,  obtained  in  the  time  domain  for  continuous  band-limited 
functions,  Shamai  and  Zeevi  [27]  derived  the  following  extended  version  of  Theorem  16. 

Theorem  20  (Shamai  and  Zeevi  [27]). 

Let  x{t)  be  a  complex  signal  that  vanishes  outside  the  interval  [0,  T)  with  a;(0)  being  real. 
The  signal  x{t)  is  uniquely  determined  (within  a;(0)  for  a  =  O^tt)  by  the  Fourier  signed- 
magnitude  for  any  0  ^  a  ^  n,  provided  the  Fourier  transform  x{v)  contains  no  multiple  real 
zeros. 

4.6.  Signal  representation  by  one  bit  of  Fourier  phase 

By  direct  application  of  Logan’s  results  [17],  a  continuous  version  of  Theorem  17,  gener¬ 
alized  to  also  incorporate  complex  signals,  was  derived. 

Theorem  21  (Shamai  and  Zeevi  [27]). 

Let  x{t)  be  a  complex  signal  that  vanishes  outside  the  interval  [0,  T),  with  a;{0)  being  real. 
The  signal  x{t)  is  uniquely  determined  (within  a  positive  multiplicative  constant  for  a  0,  tt, 
or  within  a  real  constant  and  within  a;(0)  for  a  =  0,7r)  the  one  bit  of  Fourier  phase, 
provided  the  Fourier  transform  x{v)  is  zero-free  in  the  closed  upper  half -plane. 

Using  the  analytic  signal 

(4.20)  =  P2,(y)exp(j(tfa;(i')-a)), 

/(t)  in  Logan’s  notation  becomes  equivalent  to  (27r)“^x(— in  our  case,  be- 
cause  the  Fourier  transform  (not  the  inverse)  of  this  function  is  zero  outside  the  interval 
(-T/2,  T/2).  According  to  Logan’s  theorem  in  the  context  of  our  notations 

has  only  real  simple  zeros  if  (27r)“^  is  zero-free  in  the  closed  lower  hoif- 

plane  (z^  =  cj  +  ju,  u  <  0). 

P^{u)  is  an  EFET  which  is  real  on  the  axis  u  =  co  (in  fact  it  is  a  B-function)  and  it 
determines  x{t)  except  perhaps  for  a;(0).  If  the  zeros  of  x{u)  are  all  in  the  closed  upper 
half-plane,  has  only  simple  real  zeros  and  it  is  of  course  determined  within  a 

multiplicative  constant  by  The  latter,  in  turn,  determines  the  real  zero  crossings  of 

QJ(i/)  (utilized  in  the  preceding  subsection).  determines  eventually  the  real  zero 

crossings  of  In  fact,  the  multiplicative  constant  is  positive  unless  a  =  0,  tt  since,  as 
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was  mentioned  before,  S^(iy)  determines  not  only  the  zero  crossing  instances  of 

but  also  its  sign.  In  the  case  of  a  =  7r/2,  also  determines  x{0)  which  is  assumed  in 

our  theorem  to  be  true. 

Interpretation  of  these  results  for  the  discrete  version  is  straightforward,  by  defining  x{t) 
according  to  (4.4).  The  conditions  with  respect  to  the  2:  variable  are  found  by  using  z  = 
Since  u  =  u  ju  and  w  ^  0,  all  the  zeros  in  z-notations  have  to  be  outside  or  on  the  unit 
circle;  those  on  the  unit  circle  must  be  simple  zeros. 

The  extended  version  of  Theorem  21  applies  to  complex  sequences,  and  permits  simple 
zeros  on  the  unit  circle;  it  is  valid  also  for  the  generalized  z-transform,  since  the  results 
are  valid  for  complex  [except  for  a;(0)],  continuous  causal,  and  finite  extent  signals.  The 
anticausal  case  can  be  derived  by  the  same  procedure,  using  anticausal  finite  extent  complex 
and  continuous  signals  instead  of  x{t),  or  alternatively,  in  the  case  of  discrete  signals,  by 
applying  the  transformation  z  = 

A.l.  Signal  and  image  reconstruction  from  localized  phase 

The  subject  of  one-dimensional  signal  and  image  representation  by  phase  information 
was  extended  to  localized  phase.  Porat  and  Zeevi  [21]  represented  images  by  Gabor  phase 
and  showed  that  such  representations  by  partial  information  preserve  edge  information  of  the 
original  image.  Their  work  was  further  extended  to  restoration  of  magnitude  by  constrained 
iterative  techniques  [1].  Images  (and  other  signals)  can  be  reconstructed  from  localized  (Ga- 
bor-type)  phase  more  efficiently  than  from  global  (Fourier)  phase.  One  straightforward  sav¬ 
ing  is  in  the  number  of  computational  operations  (i.e.  computational  complexity).  The  other 
reason  is  the  much  faster  convergence  rate  of  the  iterative  algorithm  in  the  case  of  localized 
phase  [1].  The  reason  for  a  faster  convergence  rate  remained  obscured  until  recently,  when 
Urieli,  Porat  and  Cohen  [28]  proved  the  following  theorem: 

Theorem  22  (Urieli,  Porat  and  Cohen  [28]). 

Let  L  and  K  be  two  convex  sets  whose  intersection  contains  the  solution  of  the  POCS  (pro¬ 
jection  onto  convex  sets)  algorithm  of  iterative  orthogonal  projections  between  the  two  convex 
sets. 

Given  a  discrete  signal  x{n)  =  [xq,  Xi,  •  •  •  ,  Xn-i]^,  the  optimal  convergence  angle  a  = 
7r/4  is  obtained  if  and  only  if  the  x{n)  is  a  geometric  sequence,  i.e., 

a;(n) 

where  c  and  q  are  scalars.  For  all  other  signals  a  <  7r/4  (i.e.  slower  convergence). 

How  does  Theorem  22  relate  to  the  convergence  rate  reported  in  [1]?  As  Urieli  et  al.  [28] 
observed,  if  a  signal  x  is  sequentially  bisected  into  smaller  segments,  the  ratio  of  monotonic 
to  nonmonotic  sequences  increases.  Since  smaller  segments  are  more  likely  to  be  monotonic 
than  larger  segments,  they  are  likely  to  more  closely  correspond  to  geometric  sequences. 
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This  finding  provides  an  additional  good  reason  for  further  investigation  of  natural  sig¬ 
nals’  representation  in  combined  spaces,  such  as  the  Gabor-type  and  wavelet  representations. 
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Abstract.  Most  of  the  “extremal  problems”  of  Harmonic  (or  Fourier)  Analysis 
which  emerged  before  the  year  2000  were  actually  bom  in  the  twentieth  century,  and 
their  emergences  were  scattered  throughout  that  century,  including  the  two  world  war 
periods.  A  great  many  of  these  problems  pertain  to  polynomials,  trigonometric  poly¬ 
nomials  and  (finite)  exponential  sums.  Writing  a  reasonably  complete  monograph  on 
this  huge  subject  (even  if  we  choose  to  restrict  it  to  polynomials  only)  would  be  a 
monumental  task,  although  the  literature  does  indeed  contain  some  valuable  mono¬ 
graphs  on  various  aspects  of  the  subject.  The  present  text  just  touches  upon  a  number 
of  extremal  problems  on  polynomials  and  trigonometric  polynomials,  with  the  hope  of 
expanding  this  same  text  in  the  near  future  to  a  much  larger  version,  and  ultimately  to 
a  “reasonably  complete”  monograph  (but  only  with  the  help  of  other  mathematicians.) 

The  theory  of  polynomials  on  the  unit  circle  is,  of  course,  part  of  classical  Fourier 
Analysis,  studied  with  the  tools  of  real  and  complex  analysis.  But  it  also  leads  to 
studying  polynomials  on  the  (cyclic)  finite  subgroups  of  the  unit  circle,  and  this  is  part 
of  Fourier  Analysis  on  finite  groups.  In  many  ways  this  leads  to  cyclotomy,  which  is 
part  of  Number  Theory  and  Algebra.  Also,  some  combinatorial  designs  (cyclic  dif¬ 
ference  sets)  show  up  in  connection  with  this  study.  Thus  the  analysis  of  polynomials 
and  trigonometric  polynomials,  even  in  one  single  variable,  is  at  the  crossroad  of  many 
important  areas  of  contemporary  mathematics.  It  is  also  much  connected  with  some 
areas  of  engineering,  such  as  signal  processing. 
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1.  Introduction 


Whatever  is  worth  doing  is  worth  doing  badly 
Gilbert  Keith  Chesterton 

Everybody  writes,  nobody  reads 
Paul  Erdos 


I  have  borrowed  from  Z.  A.  Melzak’s  excellent  book  “Companion  to  Concrete  Mathematics” 
(volume  II)  [26]  the  first  of  the  above  two  epigraphs:  ''Whatever  is  worth  doing  is  worth  doing 
badiy\  Indeed  this  sentence  is  a  most  relevant  epigraph  for  the  present  paper,  and  even  more 
so  for  any  attempt  to  write  a  fairly  complete  monograph  on  the  topics  touched  upon  in  this 
paper:  I  shall  explain  this  in  some  detail  in  Section  1.2  (on  the  “peculiarities  and  aims  of  this 
paper”).  The  second  epigraph:  "Everybody  writes,  nobody  reads''  is  a  frightening  truth  so 
concisely  enunciated  by  the  great  Paul  Erdos  (1913-1996),  and  seems  to  be  an  explanation 
for  a  good  many  of  the  evils  that  are  infecting  current  science  research,  whether  pure  or 
applied.  More  about  this  later. 


1.1.  A  little  history 

Let  me  start  with  brief  historical  remarks  concerning  (only  the  origins  of)  the  subject: 
“extremal  problems  on  polynomials  and  trigonometric  polynomials:  a  century  of  progress”, 
as  this  was  the  (much  too  ambitious)  initial  title  I  had  given  the  organizers  of  this  ASI,  before 
toning  it  down  to  the  present  title. 

To  the  best  of  my  knowledge  (at  the  time  this  paper  is  being  written,  i.e.,  in  the  first 
week  of  November  2000),  the  subject  of  "extremal  problems  on  polynomials  and  trigono¬ 
metric  polynomials",  or  at  least  the  analytic  theory  of  this  subject,  seems  to  find  its  main 
roots  in  two  (somewhat  distinct)  fertile  grounds:  on  one  hand  the  nineteenth  century  the¬ 
ory  of  trigonometric  and  Fourier  series,  and  on  the  other  hand  the  nineteenth  century  theory 
of  approximation  and  interpolation.  The  discrete  aspects  of  the  subject  are  much  related 
to  number  theory  and  can  even  be  tracked  down  (somewhat  loosely)  to  the  time  of  Gauss 
and  Lagrange.  But  the  century-old  golden  age  of  the  subject  seems  to  have  really  started 
around  1900  (precisely  in  1889  and  in  1911)  with  two  totally  independent  major  results:  first 
Markov's  inequality,  proved  in  1887  in  a  very  special  case  by  the  chemist  Mendeleiev  [27] 
and  in  1889  by  his  mathematician  friend  A.  A.  Markov  [25]  in  the  general  case,  and  then 
Bernstein's  inequality  (proved  in  1911  by  S.  N.  Bernstein  [3]  in  a  somewhat  weaker  form 
than  what  is  presently  known  as  “Bernstein’s  inequality”.) 

Markov’s  inequality  (in  its  modem  formulation)  says  that 


(1.1) 


Bahman  Saffari  /Some  polynomial  extremal  problems  which  emerged  in. . . 


203 


whenever  P(X)  =  ^  ^["^1  ^  polynomial  with  complex  coefficients,  P^{X)  is 

its  derivative,  and 

(1.2)  ||P||l_i.i]:^_m^jP(:r)|. 

(1.1)  is  an  equality  if  we  take  P{X)  =  T„(X)  where  the  Chebishev  (or  Tchebyshev)  poly¬ 
nomial  Tn(X)  is  defined  by  cosnu  —  T„(cosw).  Markov’s  inequality  had  been  proved  in 
the  special  case  n  =  2  as  early  as  1887  by  the  chemist  Mendeleiev  who  studied  it  in  con¬ 
nection  with  a  problem  on  substances  dissolved  in  a  liquid  [27].  He  then  mentioned  it  to 
A.  A.  Markov  who  became  very  interested  and  proved  [25]  the  general  case  of  (1.1)  shortly 
after  Mendeleiev’s  work  [27].  For  more  historical  details,  see  for  example  [31]  or  [32]. 

Unlike  Markov’s  inequality  (which,  as  we  saw,  originated  from  Mendeleiev’s  research  in 
chemistry),  Bernstein* s  inequality  originated  from  pure  mathematics  (approximation  theory). 
Indeed,  at  the  beginning  of  the  twentieth  century,  the  Belgian  mathematician  De  la  Vallee 
Poussin  [11]  asked  whether  any  piecewise  linear  continuous  function  defined  on  a  compact 
interval  of  E  could  be  approximated  by  polynomials  of  degree  n  with  a  (uniform)  error  o(l/n) 
as  n  — >■  oo.  With  the  less  drastic  error  0(l/n),  the  (affirmative)  answer  had  been  given  by 
De  la  Vallee  Poussin  himself.  In  a  celebrated  memoir  (which  was  awarded  a  prize  by  the 
Royal  Academy  of  Belgium  on  15  December  1911),  S.  N.  Bernstein  [3]  answered  De  la 
Vallee  Poussin’s  question  negatively:  He  proved  that  the  best  (uniform)  approximation  of  the 
function  |a:|  on  [-1, 1]  by  a  polynomial  of  degree  2n,  (n  >  1),  lies  between  •  2^  and 
f  ’  2^’  Bernstein’s  proof  of  this  theorem  heavily  uses  the  following  inequality  which  he 
proves  in  the  same  memoir  [3]:  Whenever  P(X)  =  ^ 

(1.3)  |P'(l)|  <  (-1  <  I  <  1). 

Actually,  Bernstein  [3]  proved  (1.3)  only  for  P{X)  e  EfX],  but  the  extension  of  (1.3)  to 
P(X)  e  CIX]  is  straightforward:  Indeed  if  P(A:)  e  C[X],  putQ(a;)  ^P{x))  for  -1  ^ 
X  Choose  a  e  E  so  that  e*“P'(rc)\/l  "  attains  the  maximum  \\P'{x)\/l  - 
at,  say,  x  =  Xq.  Then 

||P'(x)%/1-2;2||(_j_i,  =  e^P'{xn)sJ\-xl 

=  ^  n||Q||[-i,i]  <  ral|P|l[-i,i), 

as  claimed. 

Putting  ll/lloo  ;=  maxteE  |/(^)|  where  f{t)  is  areal  trigonometric  polynomial 

n 

m=Ao  +  {Ak  cos  kt  4*  Bk  sin  A;t), 

*=1 


(1.4) 
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the  inequality 

(1.5)  wrwoo  ^  n|i/iu 

(where  equality  holds  if  and  only  if  f{t)  =  A  cos  nt  +  Bsmnt)  is  nowadays  known  as 
'‘Bernstein's  inequality''  although  Bernstein  never  proved  this  result.  Let  us  explain  this.  The 
change  of  variable  x  =  cos  t  shows  that  any  real  cosine  polynomial 

n 

(1.6)  f{t)  ~  Ak  cos  kt 

k=0 

can  be  written  in  the  form  f{t)  =  P(cosf)  with  P{X)  e  R[X]  as  in  (1.3),  and  vice  versa. 
Thus,  apart  from  the  equality  case,  (1.5)  is  indeed  equivalent  to  (1.3)  in  the  case  when  f(t) 
is  a  cosine  polynomial  (1.6).  So  Bernstein  [3]  did  state  and  prove  (1.5)  for  all  cosine  poly¬ 
nomials.  Then  by  a  quite  complicated  (although  very  interesting)  argument  he  showed  that 
from  the  truth  of  (1.5)  for  all  cosine  polynomials  one  can  deduce  the  truth  of  (1.5)  for  all  sine 
polynomials  of  the  form 

n 

(1.7)  f{t)  ^BA:SinA:L 

Jk=l 

Actually,  his  argument,  as  presented  in  [3],  had  a  gap  that  Bernstein  did  correct  in  his 
famous  book  [4]  published  fourteen  years  later,  in  1926. 

Now,  having  proved  (1.5)  for  all  cosine  and  sine  polynomials,  Bernstein  could  only  con¬ 
clude  that  in  the  general  case  of  “mixed”  real-valued  trigonometric  polynomials  (1.4)  one 
has 

(1.8)  ll/'lloo  ^  2n||/|U. 

This  is  not  as  good  as  (1.5),  and  equality  in  (1.8)  never  holds  unless  f(t)  =  0. 

Who  supplied  the  first  proof  of  “Bernstein’s  inequality”  (1.5)  for  “mixed”  trigonometric 
polynomials  (1.4)?  In  his  1926  book  [4],  Bernstein  stated  that  the  first  proof  was  supplied  by 
E.  Landaw  [20]  in  a  personal  letter  he  sent  Bernstein  shortly  after  the  publication  of  Bern¬ 
stein’s  original  memoir  [3].  In  the  book  [4]  Bernstein  did  give  Landau’s  proof,  which  consists 
of  a  simple  and  elegant  argument  showing  that  the  truth  of  (1.5)  for  all  sine  polynomials  (1.7) 
in  fact  implies  the  truth  of  (1.5)  for  all  “mixed”  polynomials  (1.4).  Thus  Bernstein  had  done 
most  of  the  hard  work  by  proving  (1.5)  for  all  sine  polynomials,  but  had  missed  Landau’s 
simple  argument  leading  to  the  general  case!  (Such  things  happen  quite  often.) 

So  Landau’s  1912  proof  of  “Bernstein’s  inequality”  (1.5)  remained  unpublished  until 
1926,  when  Bernstein’s  book  [4]  appeared.  However,  in  a  1914  paper  devoted  to  conjugate 
trigonometric  series,  Fej6r  [15]  states  the  truth  of  Bernstein’s  inequality  (1.5)  in  the  general 
case,  and  gives  a  very  important  application  of  it.  Fejer’s  proof  of  Bernstein’s  inequality  ap¬ 
peared  two  years  later,  in  1916,  in  a  paper  of  Fekete  [16]  who  indeed  does  attribute  the  proof 
to  Fejer.  It  is  thus  plausible  that  the  first  proofs  of  “Bernstein’s  inequality”  (1.5)  for  all  real 
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“mixed”  trigonometric  polynomials  (1.4)  were  found  independently  by  Landau  [20]  and  by 
Fejer  [15],  at  about  the  same  time,  and  that  both  of  them  communicated  their  proofs  privately 
to  fellow  mathematicians  (Bernstein  and  Fekete,  respectively.)  However,  note  that  in  the  same 
year  1914  when  Fejer’s  paper  [15]  appeared,  the  brothers  M.  Riesz  [35]  and  F.  Riesz  Riesz, 
F.  [34]  published  two  new  proofs  of  Bernstein’s  inequality  (1.5),  entirely  different  from  each 
other  and  entirely  different  from  those  of  Landau  [20]  and  of  Fejer  [15]. 

Nowadays  there  are  numerous  generalizations,  extensions  and  refinements  of  (1.5)  (some 
really  profound,  others  less),  but  most  of  which  are  simply  called  “Bernstein’s  inequality” 
or  “Bernstein  type  inequality”.  A  big  treatise  would  not  suffice  to  present  all  of  them.  I 
recall  that  about  twenty  years  ago,  around  1981,  the  late  S.  K.  Pichorides  (1940-1992)  and 
myself  thought  of  writing  a  fairly  complete  monograph  just  on  the  sup-norm  versions  of  the 
Markov-Bemstein  type  inequalities,  and  in  view  of  the  enormity  of  the  literature,  we  gave  up. 

This  being  said,  what  is  the  “most  classical”  statement  (or,  if  we  prefer,  the  “most  widely 
known”  statement  of  “Bernstein’s  inequality”?  In  other  words,  if  we  ask  a  random  mathe¬ 
matical  analyst  to  tell  us  quickly  what  is  meant  by  “Bernstein’s  inequality”,  what  is  he/she 
likely  to  reply  off  the  top  of  his/her  head?  In  my  opinion  there  are  three  such  “likely”  state¬ 
ments.  The  first  is  the  above  inequality  (1.5)  for  all  real  trigonometric  polynomials  (1.4).  The 
second  is  the  extension  of  this  same  (1.5)  to  all  complex-valued  trigonometric  polynomials  of 
the  form 

n 

(1.9)  f{t)  =  ^  (ttfe  G  C  for  all  A;  =  -n, . . . ,  n) 

k=—n 

(with  equality  in  (1.5)  now  reached  if  and  only  if  f{t)  =  ae*”*  +  where  a  and  b  are 
complex  constants.)  The  third  is  the  following:  If 

n 

F(X)  =  ;^atX*€C[X]  and  ||P|U  :=  max |P(e«)| , 

jt=0  ^ 

then 

(1.10)  llP'lloo  <  n||P|U 
(with  equality  in  (1.10)  if  and  only  if  P{X)  —  aX^.) 

Now,  are  the  above  three  statements  of  “the  classical  Bernstein  inequality”  equivalent! 
(By  “equivalent”  I  mean,  in  the  present  context,  that  each  can  be  deduced  from  the  others  in 
a  quite  easy  manner.  Otherwise  I  am  aware  that  any  two  true  mathematical  statements  are 
equivalent!)  The  answer  is  that  the  first  two  are  indeed  equivalent,  but  that  (1.10)  is  a  weaker 
result.  Indeed,  the  first  statement  is  an  obvious  special  case  of  the  second,  but  the  second  is 
also  a  straightforward  consequence  of  the  first.  (Here  is  a  simple  proof  from  G.  G.  Lorentz’s 
book  [23],  by  the  same  argument  as  the  one  presented  right  after  (1.3)  in  this  introduction, 
but  I  do  not  know  who  gave  it  first:  With  f{t)  of  the  form  (1.9),  select  a  G  R  so  that 
attains  the  value- ||/'||oo,  say  for  f  =  Now  g{t)  :=  §^(e^^/(t))  is  of  the  form  (1.4),  so 
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||/'||co  =  e“/'(^o)  =  g'iu)  ^  llsiloo  <  ll/lloo-  The  equality  case  is  ignored  in  [23],  but  this 
is  not  hard  either.)  Now  the  third  statement  (1.10)  is  obviously  a  consequence  of  the  second, 
and  it  is  also  (easily)  derived  directly  from  the  first  on  page  45  of  Bernstein’s  book  [4].  Yet 
I  do  not  know  of  any  simple  way  of  deriving  the  first  or  second  statement  from  (1.10).  On 
the  other  hand  (1.10),  as  well  as  its  extensions  to  all  LP  norms  with  p  ^  1,  have  direct  proofs 
much  simpler  than  any  of  those  available  for  the  first  two  statements.  The  reader  can  work 
them  out  herself  or  look  them  up, /or  example,  in  [32].  Here  is  yet  another  way  of  seeing  that 
the  first  two  statements  (1.5)  are  stronger  than  (1.10).  Indeed  they  imply 

(1.11)  |PV‘)l<^(l|P||oo  +  |P(e“)l) 

(as  easily  checked,  or  see  for  example  [17]),  and  (1.11)  is  obviously  a  deeper  form  of  (1.10). 

Does  Markov’s  inequality  (1.1)  compare,  depth-wise,  to  any  (“classical”  form  of)  Bern¬ 
stein’s  inequality,  although  it  was  discovered  utterly  independently  and  twenty-two  years 
earlier,  as  we  saw?  Using  Bernstein’s  original  theorem  (1.3)  and  a  nice  theorem  of  Schur 
(Theorem  6,  Chapter  3  of  Lorentz’s  book  [23]),  we  see  that  Markov’s  inequality  is  a  corol¬ 
lary  to  (1.3)  and  thus  to  (1.5)  for  all  cosine  polynomials,  but  modulo  Schur’s  theorem  which 
is  not  trivial  (although  not  very  difficult  either.)  Altogether  we  may  rate  Markov’s  inequality 
(1.1)  at  about  the  same  level  of  depth  as  Bernstein’s  inequality  (1.5)  for  all  trigonometric 
polynomials  of  the  form  (1.4)  or  (1.9). 

The  above  inequalities  of  Markov  and  Bernstein  (together  with  their  equality  cases)  are 
typical  examples  of  extremal  problems,  as  they  are  equivalent  to  the  problems  of  finding  the 
maximum  of  ||P'||[_i,i]/||P||(_i,i|  and  of  ||/'||oo/||/|U  where  degP  <  n  and  deg/  ^  n, 
(P  ^  0  and  /  ^  0).  I  chose  (the  earliest  versions  of)  the  Markov-Bemstein  inequalities  as 
typical  illustrations  of  extremal  problems  for  two  main  reasons:  1)  They  were  among  the  first 
(but  certainly  not  the  very  first)  that  marked  the  beginning  of  the  century-old  “golden  age” 
(twentieth  century.)  2)  They  have  been  the 'starting  point  of  an  enormous  literature  concerning 
their  generalizations,  extensions,  refinements  and  applications.  Their  usefulness  in  numerous 
areas  of  mathematics,  physics  and  engineering  has  been  extraordinary,  ever  since  they  were 
discovered.  See /or  example  [1],  [8],  [32]  and  many  other  good  references. 

This  being  said,  there  are  at  least  two  different  meanings  to  the  expression  “extremal 
problem”: 

1.  One  meaning  is  obviously  the  search  for  (attained  or  unattained,  global  or  local) 
suprema  and/or  infima  of  real-valued  functions  defined  on  a  set  (of  functions,  of  poly¬ 
nomials,  of  numbers,  of  measures,  etc.).  With  this  first  acceptation,  there  is  a  consid¬ 
erable  overlap  between  the  subject  of  “extremal  problems”  and  that  of  “inequalities” 
(or  rather  ^'optimal  inequalities”). 

2.  Another  one  is  a  problem  pertaining  to  properties  of  such  suprema  and  infima  (whether 
local  or  global,  etc.),  for  example  that  of  their  distribution.  This  will  not  be  discussed 
in  this  paper,  but  hopefully  will  be  in  [39]. 
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The  Bemstein-Markov  inequalities  are  typical  examples  of  polynomial  extremal  problems 
which  find  their  roots  in  the  nineteenth  century  theory  of  approximation  and  interpolation. 
But  the  majority  of  twentieth  century  polynomial  extremal  problems  find  their  roots  in  the 
nineteenth  century  theory  of  trigonometric  and  Fourier  series.  We  will  (briefly)  mention 
a  few  examples  in  this  paper.  The  reader  should,  however,  first  read  the  next  section  on 
“peculiarities”  of  this  paper. 

1,2.  Peculiarities  and  aims  of  this  paper 

I  first  explain  in  a  page  or  two  why  this  paper  is  a  quite  peculiar  one.  Its  aims  will  then 
follow  logically. 

In  May  2000,  a  few  weeks  before  this  July  2000  ASI,  I  realized  that  my  original  title 
^'Extremal  problems  on  polynomials  and  trigonometric  polynomials — a  century  of  progress'' 
was  too  ambitious  and  unrealistic  for  a  2^ -hour  tutorial  lecture.  So  I  toned  it  down  to  the 
present  title.  As  for  my  oral  lecture,  I  personally  found  it  unsatisfactory,  but  in  view  of  the 
audience’s  enthusiasm  it  seemed  successful.  I  do  not  consider  this  to  be  my  personal  success 
but  rather  the  merit  of  this  delightful  subject  itself,  where  the  problems  and  theorems  are 
most  often  simple  to  state  but  the  proofs  are  sometimes  very  hard  and  profound,  sometimes 
easy  and  straightforward.  I  hope  to  return  to  this  “instability  phenomenon”  of  mathematical 
statements  (with  its  sometimes  very  sad  consequences)  either  at  the  end  of  this  paper,  or  more 
likely  in  [39]. 

I  intended  to  write  up  my  lecture  for  the  ASI  proceedings  but  a  major  illness  and  accident 
in  my  family  left  me  no  free  time  in  August  and  September  of  2000.  Thus  I  missed  the 
submission  deadline  and  decided  to  write  instead  a  much  longer  version,  perhaps  a  whole 
monograph.  But  J.-P.  Kahane  (who  was  my  teacher  and  whose  judgment  I  respect)  pointed 
out  that  a  monograph  on  such  a  huge  subject  might  take  years  for  a  single  person,  and  in 
addition  the  author  might  be  tempted  to  stop  the  write-up  and  work  on  the  fascinating  open 
problems  of  this  subject.  So  Kahane  advised  me  to  request  a  deadline  extension  from  the 
editor  and  try  to  write  the  ASI  paper  anyway. 

As  the  editor  granted  such  extension,  I  could  use  a  week  or  so  of  peace,  in  the  house  of  a 
friend  in  Newport,  Rhode  Island,  to  produce  the  present  paper  at  the  beginning  of  November 
2000.  So  this  text  was  written  within  a  very  short  period,  when  I  had  no  access  to  a  library 
nor  to  my  personal  notes  and  documents  either  (as  I  had  left  nearly  all  of  them  in  Europe), 
and  also  the  editor  had  imposed  a  30-page  bound  on  the  length  of  the  paper.  Thus  I  had  no 
choice  but  to  write  a  “highly  imperfect”  paper,  with  the  following  features  (some  of  which 
the  reader  might  interpret  as  the  “aims”  of  this  paper): 

1.  A  number  of  topics  are  touched  upon  but  all  of  them  very  briefly,  and  most  of  them 
quite  superficially.  But  I  intend  to  write  soon  an  extended  version  [39]  of  this  paper. 

2.  All  of  the  topics  are  among  those  I  had  wished  to  mention  anyway,  but  the  “choice” 
of  them  (in  this  paper)  is  nowhere  near  any  optimal  order  of  priority.  The  choice  of 
topics  here  is  by  no  means  the  result  of  thoughtful  considerations,  but  rather  the  result 
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of  time  limitations  only:  To  write  this  paper  in  the  short  alloted  time,  I  had  to  write 
down  whatever  would  come  to  my  mind  after  some  very  quick  thinking,  in  a  nearly 
random  way.  Had  I  written  this  paper  one  week  earlier  or  one  week  later,  the  choice 
of  topics  would  have  been  quite  different.  However,  in  the  extended  version  [39],  not 
only  the  same  topics  will  be  treated  in  more  detail  and  depth,  but  also  other  topics  will 
be  added. 

3.  Although  many  nice  topics  are  not  even  mentioned  here,  all  those  touched  upon  in  this 
paper  are  subjects  that,  I  for  one,  do  consider  extremely  interesting.  All  of  them  can 
be  described  as  “Erdds-style  harmonic  analysis”.  Proofs  are  omitted  (save  for  a  couple 
exceptions),  due  to  time  and  space  limitations.  Many  of  the  topics  mentioned  concern 
my  own  results  and  conjectures,  usually  unpublished  (and  even  unwritten.)  This  is 
again  simply  due  to  time  limitations:  After  all,  my  own  results  (whether  written  or 
not)  where  the  most  accessible  ones  under  such  “space-time”  pressure! 

4.  No  systematic  history  of  any  of  the  topics  mentioned  will  be  presented  in  this  paper, 
but  hopefully  this  will  be  done  (to  some  extent)  in  the  extended  version  [39],  at  least 
for  some  of  the  topics.  However,  a  number  of  historical  points  or  anecdotes  will  be 
encountered  here.  One  of  my  personal  obsessions  is  historical  accuracy  and  honesty, 
although  I  am  not  always  successful  at  that.  It  is  a  very  unfortunate  fact  that  his¬ 
torical  accuracy  and  honesty  is  not  widespread  among  mathematicians,  whether  due 
to  ignorance,  negligence  or  malevolence.  Authors  very  frequently  reproduce,  in  their 
publications,  historical  errors  or  lies  they  have  read  elsewhere,  without  checking  any¬ 
thing,  thus  perpetuating  errors  or  lies.  Such  sad  things  are  often  unintentional,  but  too 
often  intentional  also.  I  have  had  my  fair  share  of  this  kind  of  historical  negligence, 
but  it  was  always  unintentional. 

5.  What  was  just  said  about  historical  points  can  also  be  said  about  references,  and  the 
way  erroneous  or  inappropriate  references  are  perpetuated  and  carried  from  publication 
to  publication.  Despite  my  good  intentions  in  this  respect,  I  cannot  guarantee  that  all 
the  references  in  this  paper  are  correct,  in  view  of  the  circumstances  under  which  this 
paper  was  written.  Quite  often,  I  do  not  even  give  any  reference  at  all  to  a  result  I  am 
quoting  only  by  relying  on  my  memory,  as  I  do  not  have  the  reference  on  my  desk  at 
the  moment  of  writing.  (This  is  the  case  even  with  my  own  papers!) 

6.  In  conclusion,  this  paper  is  not  to  be  viewed  as  a  survey  of  extremal  problems,  even  in 
a  very  limited  sense.  Its  main  function  is  to  serve  as  a  “memorandum”  for  myself  with 
a  view  to  the  extended  version  [39].  The  reader’s  criticisms,  comments  (and  whatever 
information  he/she  could  give  me)  are  welcome  and  will  be  gratefully  acknowledged. 

2.  Back  to  Bernstein’s  inequality 

Two  of  the  most  useful  refinements  of  Bernstein’s  inequality  are  Szego’s  inequality  [42] 
proved  in  1928  and  the  Schaake-van  der  Corput  inequality  [40]  proved  in  1935.  The  latter 
was  quickly  noted  to  be  a  weaker  form  (and  a  corollary)  of  Szego’s  inequality,  but  has  a 
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usefulness  of  its  own.  Both  are  among  the  very  few  results  that  I  will  prove  in  this  paper. 
These  proofs  will  not  be  those  of  the  original  authors,  as  they  were  quite  complicated,  but 
proofs  of  my  own  (as  they  are  the  simplest  ones  I  know  of.) 

2. 1 .  Statement  of  the  Schaake-van  der  Corput  inequality 

Theorem  2.1  (Schaake  &  van  der  Corput  [40]). 

For  any  real  trigonometric  polynomial 

n 

(2.1)  / (t)  =  ^0  +  ^  {Ak  cos  kt  +  Bk  sin  kt) 

k=l 

we  have,  for  all  t  the  following  refinement  of  Bernstein's  inequality  (1.5): 

(2.2)  U\t)f^n^f{t)f^n^f\t. 

In  other  words,  we  have  the  identity 

(2-3)  ll/''  +  nV^lloc  =  n^||/|lL 

Remarks  on  the  equality  case  of  (2.2).  Although  the  inequality  (2.2)  is  a  refinement  of 
Bernstein’s  inequality  (1.5),  the  equality  cases  of  these  two  inequalities  are  very  different  in 
nature.  For  Bernstein’s  inequality  (1.5),  the  assumption  that  |/'(to)|  =  n\\f\\^  fox  at  least  one 
point  to  implies  that  f{t)  is  a  sinusoidal  function:  f{t)  =  A  cos  nt-\-B  sin  nt.  But  for  any 
(arbitrary)  real  trigonometric  polynomial  f{t)  of  the  form  (2.1),  the  Schaake-van  der  Corput 
inequality  (2.2)  is  obviously  always  an  equality  at  every  to  where  |/(io)|  ==  ||/||oo-  Also 
(2.2)  is  an  equality  at  every  t  £R  whenever  f{t)  =  A  cos  nt~\-  B  sin  nt  or  f{t)  =  constant, 
and  the  converse  of  this  is  also  trivially  true.  However  one  can  easily  show  that  if  (2.2) 
is  an  equality  for  at  least  2n  distinct  points  of  the  semi-open  interval  [0, 27r[,  then  either 
f{t)  =  A  cos  nt-j-B  sin  nt  or  f(t)  =  constant.  In  this  last  remark  the  number  2n  is  optimal, 
(i.e.,  minimal):  One  can  find  non-sinusoidal  and  non-constant  real  trigonometric  polynomials 
of  the  form  (2.1)  for  which  (2.2)  is  an  equality  at  2n  -  1  distinct  points  of  the  semi-open 
interval  [0, 27r[. 

2.2.  Saffari' s  proof  of  the  Schaake-van  der  Corput  inequality 

We  will  obtain  (2.2)  as  a  corollary  to  Bernstein’s  inequality  (1.5)  together  with  its  equality 
case,  that  is,  ||/'||oo  =  ?^||/||oo  if  and  only  if  f(t)  is  a  sinusoidal  function  of  the  form 

(2.4)  /(t)  =  A  cos  nt-\-  B  sin  nt. 

Our  proof  is  by  contradiction.  Let  K  denote  the  vector  space  (on  the  field  R)  of  all  real 
trigonometric  polynomials  of  the  form  (2.1),  with  degree  ^  2n.  For  any  /  €  K\{0}  (i.e.. 
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/  ^  0),  define: 

(2.5) 

and  let 

(2.6) 


m 


To  prove  that  the  maximum  defined  by  (2,6)  indeed  exists,  first  note  that  K  is  a 
finite-dimensional  space  over  R  (since  dim^  =  2n  -F  1).  If  K  is  equipped  with  the  norm 
topology,  then  (2.5)  defines  a  continuous  mapping  H  from  K\{0}  into  E:  Indeed,  since 
dimjR  Vn  is  finite,  the  (linear)  differentiation  operator  /-)•/'  is  continuous  on  K,  and  so  the 
mapping  f  —>  is  a  continuous  function  from  14  into  V2n-i-  Also  all  the  norms 

(and  in  particular  the  sup-norm  function  (^  ->  ||(^||oo)  are  continuous  on  the  finite-dimensional 
vector  spaces  14  and  V2n-u  (dimR  Vsn-i  ==  4n  -  1),  so  the  mapping  H  defined  by  (2.5)  is 
continuous  as  claimed.  Also  H(Xf)  =  H(f)  for  all  A  G  E  with  A  ^  0,  hence  the  maximum 
defined  by  (2.6)  is  the  same  as  the  maximum  of  H{f)  when  /  is  restricted  to  the  (compact) 
unit  sphere  of  14  for  the  sup-norm  metric,  and  this  last  maximum  is  indeed  attained. 

Obviously  >  1  (since  for  every  /  G  K\{0}),  and  the  Schaake-van  der 

Corput  inequality  we  wish  to  prove  is  equivalent  to  the  equality  i/max  —  1.  To  prove  this  by 
contradiction,  we  suppose  henceforth  that 


(2.7)  -ffmax  >  1. 

Let  ip  G  14\{0}  be  some  trigonometric  polynomial  of  the  form  (2.1)  for  which 


(2.8) 


HW  = 


We  first  note  that  ip  is  not  a  constant  function,  since  otherwise  Tp'  =  0  and  so,  by  (2.5), 
H{ip)  =  1  contrary  to  our  assumption  (2.7).  Thus  -ip'  ^  0,  i.e.,  ip*  G  14\{0},  hence  we  can 
consider 


(2.9) 


nm  - 


Let  the  trigonometric  polynomial  (ip'{t)/  +  {ipit)/  reach  its  absolute  maximum  at 
^0  €  E: 


(2.10)  (V’'(io))^  +  (i’ik)/  = 


So  its  derivative  vanishes  at  to: 

(2.11)  2i/>'(to)tft"(to)  +  2n^ip(t(i)'ip'{to)  =  0. 


Bahman  Saffari  /Some  polynomial  extremal  problems  which  emerged  in. . . 


211 


By  (2.9)  we  cannot  have  V>'(to)  =  0,  otherwise  ||  +  n^V’^||oo  =  n?  by  (2.10),  and 

SO  we  would  have 

17/, /A  =  , 

nmi  ~  n^ilV-lli  ^ 

which  is  impossible  in  view  of  (2.7)  and  (2.8).  Thus  ^'(to)  #  0,  so  upon  dividing  both  sides 
of  (2.1 1)  by  Ip' {to)  we  get 

(2.12)  'ip"{to)  +  =  0. 

Note,  incidentally,  that  our  observation  ip' (to)  ^  0  is  another  way  of  seeing  that  ^  0  so 
as  to  justify  (2.9).  Finally,  since  for  sinusoidal  functions  f(t)  of  the  form  (2.4)  we  obviously 
have  H{f)  =  1,  it  follows  from  (2.7)  and  (2.8)  that  ip{t)  is  not  a  sinusoidal  function  of  the 
form  (2.4).  Therefore,  by  the  equality  case  of  Bernstein’s  inequality, 

(2.13)  <  nll^lloo. 

We  now  have  all  we  need  to  find  a  lower  bound  for  (2.9)  which  will  yield  the  desired  contra¬ 
diction  to  (2.7).  By  (2.9)  and  the  ^m’cHnequality  (2.13), 

^  ir^nV^lloo  ..  {r{to)?  +  nHf(to)/ 


=  - Jirnia -  [upon  dividing  by  n-*] 


>''  +  nVlloo 


[by  (2.10)] 


=  H. 


[by  (2.8)] 


Thus  H (ip')  >  iTmax,  which  contradicts  the  maximality  of  iT^ax  defined  by  (2.6).  There¬ 
fore  our  assumption  (2.7)  leads  to  a  contradiction,  and  thus  we  have  =  1  and  have 
proved  the  Schaake-van  der  Corput  inequality.  □ 

2.3.  Other  proofs  or  reformulations  of  the  Schaake-van  der  Corput  inequality 

I  gave  my  own  proof  of  the  Schaake-van  der  Corput  inequality  as  it  is  one  of  the  simplest 
ones  I  know,  and  certainly  simpler  than  the  original  one  [40].  Just  relying  on  my  memory  at 
the  moment  of  this  write-up,  I  think  I  know  a  good  dozen  different  proofs  of  this  inequality. 
There  might  be  many  more  in  the  literature.  I  hope  to  give  some  of  them  in  [39].  In  this 
connection,  here  is  an  unfortunate  anecdote:  Just  over  twenty  years  ago,  in  the  late  1970’s, 
I  found  my  above  proof  of  the  Schaake-van  der  Corput  inequality  and  gave  a  seminar  talk 
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about  it  at  Orsay.  The  following  week  the  late  S.  K.  Pichorides  (1940-1992)  talked  in  the 
same  seminar  on  a  nice  application  of  the  Schaake-van  der  Corput  inequality.  Shortly  after, 
without  Pichorides  or  myself  knowing  about  it,  two  physicists  happened  to  submit  a  joint 
mathematical  note  to  the  French  “Comptes  Rendus  de  V  Academic  des  Sciences”  giving 
a  beautiful  number-theoretic  proof  of  the  Schaake-van  der  Corput  inequality  (which  they 
believed  to  be  a  new  result),  and  they  had  applications  of  this  inequality  to  optics.  Their  note 
was  abruptly  rejected  by  the  referees  (two  mathematicians  at  Orsay,  none  of  whom  were  J.- 
P.  Kahane  or  Y.  Meyer),  with  the  comment:  *'This  result  is  not  new,  it  is  a  recent  theorem  of 
Saffari  and  Pichorides'\  So  much  for  the  competence  of  such  “referees”,  who  never  bothered 
to  ask  me  or  Pichorides.  Only  one  year  after,  when  I  heard  about  this  rejection,  could  I 
tell  those  “referees”  that  the  result  actually  went  back  to  1928  [42]  but  such  a  nice  new 
proof  should  not  have  been  rejected.  In  [39]  I  will  present  the  (unpublished)  proof  of  those 
physicists,  and  disclose  their  names. 

Polynomial  reformulation  of  the  Schaake-van  der  Corput  inequality  (2.2).  Let  the 

polynomial  P{x)  =  ^  self-inversive,  that  is,  a„_jt  a/t  for  all 

A;  =  0, 1, . . . ,  n.  Then  the  Bernstein  inequality  ||P'||oo  <  'f^\\P\\oo  can  be  improved  to: 

(2.14)  llP'lloo  ~l|P||oo  (yes,  equality!) 

Proof.  The  truth  of  (2.14)  when  n  is  odd  follows  from  the  truth  of  (2.14)  when  n  is  even 
by  considering  the  even-degree  polynomial  P{z^).  So,  without  loss,  we  may  suppose  n  even, 
n  =  2m.  The  (real-valued)  trigonometric  polynomial  f  (t)  :=  e“*”^*P(e'^)  is  then  of  the  form 
(2.1),  with  degree  m,  hence  upon  writing  P(e**)  =  e*""^/(i)  and  differentiating  both  sides, 
we  get 

ie''P'(e*')  -  {f'(t)  +  imf{t)) , 

hence 

(2.15)  \P’{e'‘)\^  =  (m?  +  mm{t)f  . 

Thus  (2. 14)  is  indeed  equivalent  to  (2.3).  □ 


The  above  polynomial  reformulation  (2.14),  like  the  original  formulation  (2.2),  was  re¬ 
discovered  by  many  people.  Some  of  these  proofs  do  shed  new  light  on  the  subject,  see  [39]. 

2.4.  “Self-improvement”  of  the  Schaake-van  der  Corput  inequality 

This  is  an  important  topic,  and  we  will  see  an  application  of  it  in  Section  4,  Let  the  real 
trigonometric  polynomial  f{t)  be  as  in  (2.1),  and  consider  the  trigonometric  polynomial  (of 
degree  <  2n  -  1): 

(2.16) 


Mt)  :=  + 
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By  (2.3)  we  have  ||/i||oo  =  '^^ll/IIL-  Applying  the  Schaake-van  der  Corput  inequality  (2.2) 
to  this  fi{t)  (with  n  replaced  here  by  2n  -  1),  we  obtain 

(2.17)  imf  +  (2n  -  1)"  (Mt)f  ^  (2n  -  l)^||/i||L 
which,  on  dividing  by  (2n  —  1)^  and  using  (2.16),  can  be  rewritten: 

(2.18)  +  nV(t))'  +  ({fit)/  +  {f{t)/y  <  n^f\t. 

This  is  a  refinement  of  (2.2).  We  can  continue  such  improvements  indefinitely,  but  they 
become  too  complicated  and,  mostly,  this  method  is  very  wasteful  as  it  ignores  the  fact  that 
fi{t)  ^  0  while  for  non-negative  trigonometric  polynomials  there  is  a  better  form  of  the 
Schaake-van  der  Corput  inequality  than  (2.2),  namely  inequality  (2.19)  below: 

Theorem  2.2  (The  case  of  non-negative  trigonometric  polynomials). 

If  a  trigonometric  polynomial  g(t)  of  the  form  (2.1)  satisfies  g{t)  ^0  for  a// 1  G  M,  then  for 
all  t  G  K 

(2.19)  ^  ■  g{t)- 

Proof.  Apply  (2.2)  to  the  polynomial  f{t)  :=  g(t)  -  |||(/l|oo.  □ 


Now,  applying  (2.19)  to  the  non-negative  trigonometric  polynomial  fi{t)  given  by  (2.16), 
we  obtain  for  any  (not  necessarily  non-negative)  real  trigonometric  polynomial  of  the  form 
(2.1): 


(2.20) 


(2n-iy  + 


2 

oo 


which  is  a  finer  improvement  of  the  Schaake-van  der  Corput  inequality  than  (2.18).  In  turn, 
there  is  yet  an  improvement  of  (2.20)  for  non-negative  trigonometric  polynomials.  Actu¬ 
ally  there  is  a  rather  nice  method  for  refining  (2.20)  indefinitely  while  obtaining  expressions 
which  are  not  too  complicated  and  also,  in  a  sense,  optimal.  We  will  drop  this  matter  here 
but  might  return  to  it  in  [39]  under  the  (weird-sounding  but  appropriate)  term  of  ''analytic 
bootstrapping''.  Let  us  also  note  that,  just  as  for  (2.14),  the  other  variations  and  refinements 
of  the  Schaake-van  der  Corput  inequality  can  be  reformulated  in  terms  of  algebraic  polyno¬ 
mials,  whether  self-inversive  or  not.  Thus,  for  P[X)  —  G  C[X]  not  necessarily 

self-inversive, 

(2.21)  ^  lP(e’‘)|)'  +  lP(e‘‘)|'  <  ll-P|Poc- 


Proof.  Divide  both  sides  of  (2. 19)  by  n^p(i)  and  take  g(t)  =  |P(e**) p.  □ 
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2.5.  Elementary  applications  of  the  Schaake-van  der  Corput  inequality 

If  f{t)  is  a  real  trigonometric  polynomial  of  the  form  (2.1),  then  one  can  refine  the  trivial 
inequality  ||/||2  ^  ||/||oo  into: 

(2.22)  n-W2  +  \\f\\U\\f\\lo^ 

This  is  obtained  by  integrating  (2.2)  on  [0, 27r].  □ 

Although  we  lost  information  in  the  integration  process,  and  (2.22)  is  therefore  weak  and 
wasteful,  it  is  still  useful  and  I  hope  to  give  some  application  of  (2.22)  in  [39].  An  even  more 
useful  inequality  is  the  following: 


Theorem  2.3.  Ifg(t)  is  a  non-negative  trigonometric  polynomial  of  the  form  (2.1),  then 
one  can  refine  the  trivial  inequality  ll^?lll*  Halloo  into: 

(2-23)  »^-^II9'II^  +  II<?IIN  Nil -1191100. 

Equivalently,  if  P{X)  =  €  C[X]  is  a  (not  necessarily  self-inversive) polynomial 

with  complex  coefficients,  then  the  trivial  inequality  ||F|||  ^  ll-PlIi’II^IlL  ^^n  be  refined 
into: 


(2.24) 


1 

dt 


+  11^114^  ^  NIIMI-PII 


2 

00‘ 


Proof.  To  obtain  (2.23)  integrate  (2.19)  on  [0, 27r].  Then  take  g{t)  —  |P(e**)f  to  obtain 
(2.24).  □ 


2.6.  Szego's  inequality 

One  year  after  the  appearance  of  their  1935  paper  [40]  in  which  (2.2)  was  proved,  Schaake 
and  van  der  Corput  pointed  out  in  a  short  note  [41]  that  their  result  (2.2)  was  already  contained 
in  (i.e.,  was  a  weaker  form  of)  a  more  precise  inequality  published  by  Szego  [42]  in  1928. 
The  statement  of  Szego’s  inequality  is  as  follows: 

Theorem  2.4.  Let  f{t)  be  a  real  trigonometric  polynomial  of  the  form  (2.1 ),  and  let  f{t) 
denote  its  conjugate  trigonometric  polynomial.  Then,  for  allt 

(2.25)  |n/(f)  -  f'(t)\  +  yj{m/+{my  <  nll/lloo  • 

In  this  paper  I  will  not  give  Szego’s  original  proof  of  (2.25)  but,  in  section  2.7  below,  I 
will  give  the  (possibly  new)  simple  proof  by  myself  and  the  late  S.  K.  Pichorides  [36],  via  an 
interpolation  method.  Yet  it  is  conceivable  that  the  Saffari-Pichorides  proof  of  (2.25)  (or  a 
similar  one)  is  already  in  the  literature. 
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A  few  remarks  before  quitting  this  section  2.6.  First,  in  (2.25)  f'{t)  is  the  conjugate 
function  of  the  derivative  f{t)  and  also  the  derivative  of  the  conjugate  function  /(f),  as  these 
are  equal.  So,  no  ambiguity  here. 

Another  remark  is  that  Szego’s  inequality  (2.25)  is  finer  than  the  Schaake-van  der  Corput 
inequality  (2.2)  simply  because,  for  all  f  E  E, 

(2.26)  +  <  \nf(t)  -  /'(<)|  +  y^(/'W)V  (/'(t))'. 

To  get  (2,26),  take  x  —  nf{t),y  =  f{t),  z  —  f(t)  in  the  inequality 

(2.27)  <  Irr  -  2;|  +  y/y^ 

which  holds  for  3X\x,y,z  eR  and  is  just  the  ordinary  triangle  inequality  for  Euclidean  norm. 

Equality  cases  of  Szego’s  inequality  (2.25):  One  can  see  in  several  ways  (e.g.,  from  our 
proof  in  the  next  section  2.7)  that  equality  holds  for  allt  eR  if  and  only  if  f{t)  is  of  the  form 

(2.28)  /(f)  =  Ao  +  Acosnf  +  Bsinnf, 

Le.,  for  a  wider  class  than  the  ones  for  the  Schaake-van  der  Corput  and  Bernstein  inequalities. 
Also,  for  a  given  /(f),  there  are  points  fo  E  M  (depending  on  /)  where  equality  holds  in 
(2.25).  This  discussion  is  interesting  but  we  drop  it  in  this  paper, 

A  final  remark:  By  throwing  out  nf{t)  -  f{t)  |  in  (2.25),  one  obtains: 

(2.29)  + 
or,  equivalently, 

(2.30)  |lP'|U^nP(P)|U 

for  all  polynomials  P{X)  =  ^  (2.29)  and  (2.30)  had,  of  course, 

been  noted  by  Szegd  (in  [42]  and  elsewhere).  They  are  also  in  Zygmund’s  classical  treatise 
[44],  together  with  their  1/  generalizations,  (p  >  1),  and  therefore  well  known  to  a  wide 
public  of  present-day  analysts.  However,  (2.2)  (which,  incidentally,  does  not  imply  (2.29) 
nor  is  implied  by  it  either)  and  (2.25)  are  not  in  Zygmund’s  treatise  [44]  and  thus  are  not  as 
widely  known  as  they  deserve.  In  [44],  Zygmund  does  attribute  (2.29)  to  Szego  and  refers 
to  Szego’s  1928  paper  [42],  yet  the  proof  of  (2.29)  presented  by  Zygmund  in  [44]  is  not 
Szegd ’s  original  proof  but  another  proof  akin  to  one  of  the  earliest  proofs  of  Bernstein’s 
inequality  (1.5)  by  Marcel  Riesz  [35]  (or  to  the  interpolation  method  of  section  2.7  below). 
The  last  time  I  saw  the  late  Professor  Zygmund,  in  1984, 1  asked  him  why  he  chose  to  include 
(2.29)  in  his  treatise  [44]  but  not  the  similar  (and  even  more  useful)  Schaake-van  der  Corput 
inequality  (2.2)  nor  the  strong  Szegd  inequality  (2.25)  either.  Zygmund  replied  that,  until 
my  conversation  with  him  about  this  matter,  he  had  never  heard  of  the  inequalities  (2.2) 
and  (2.25),  and  although  he  had  quoted  Szegd’s  paper  [42]  in  his  treatise  [44],  he  actually 
had  never  taken  a  look  at  Szegd’s  paper  [42]  (where  (2.25)  was  originally  proved).  He  had 
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also  not  read  some  other  papers  quoted  in  his  treatise  [44],  and  in  which  (2.2)  and  (2.25) 
were  mentioned  together  with  some  extensions.  At  least  Zygmund  was  perfectly  honest  and 
candid  in  this  respect.  He  was  a  good  friend  of  Szego,  and  the  two  did  great  joint  research 
(some  of  which  was  on  polynomials,  see  [43]). 


2.7.  The  Pichorides-Sajfari  proof  of  Szego's  inequality 

(Again,  I  note  that  our  proof  below  might  already  be  in  the  literature.) 
The  proof  is  based  on  the  identity 


(2.31) 


2/i7r  —  a\ 
n  ) 


gffc(2/i7r— a)/n 


k-\-ke 


for  all  a  6  E  and  all  integers  k,  n  such  that  0  ^  A:  <  n,  where  F„_i(a:)  denotes  the  Fejer 
kernel  of  degree  n-~\: 


(2.32) 


To  check  (2.31),  use  the  middle  sum  in  (2.32)  with  x  =  (2/i7r  —  a)ln,  plug  this  expression 
into  the  left  hand  side  of  (2.31)  and  change  the  order  of  summation,  then  note  that  the  new 
inner  sum  is  always  zero  except  for  r  =  —k  and  for  r  =  n  —  A:. 

Another  form  of  (2.31)  is 


(2.33)  gtM2/i7r  a)/n  ^  ^  |A:|cosq;  —  ik  sin  a 

for  all  a  €  E  and  all  integers  A:,  n  such  that  —n  ^  k  ^  n. 

[For  A:  ^  0,  (2.33)  is  the  same  as  (2.31);  for  A:  <  0  take  the  complex  conjugates  on  both 
sides  of  (2.31)]. 

On  taking  A:  =  0,  we  have  the  following  useful  identity: 

(2.34)  (for  all  a  G  E). 


Now,  for  any  integer  k  with  1  <  A;  ^  n,  multiply  both  sides  of  (2.31)  by  (with 

ipic  G  E)  and  then  take  the  real  parts,  to  get 


kt  (fk  h  • 


2hTT 


=  {n  —  k)  cos  {kt  +  ^jt)  +  fccos  (kt  +  <pk  —  oi). 


(2.35) 
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We  now  have  all  we  need  to  prove  the  interpolation  formula: 

~  f(i)  +  f{i)  cos  a  -  f{t)  sin  a  = 

^  (2h'K  —  a\  2hTT  —  a\ 

(2.36) 

for  any  real  trigonometric  polynomial  f{t)  of  the  form  (2.1).  Indeed,  write  f{t)  in  the  more 
convenient  form 

(2.37)  /(<)  =  Ao  +  ^  r*  cos  {kt  +  ipk),  (r^  =  sjAl  +  Bf) 

fc=l 

and  note  that,  by  linearity  of  the  differentiation  f  f  and  of  the  Hilbert  transform  /  — ^  it 
suffices  to  check  (2.36)  for  f{t)  =  1  and  for  f{t)  =  cos  {kt  (pk),  {1  ^  k  ^  n).  For  f{t)  = 
1,  (2.36)  reduces  to  (2.34).  For  f{t)  =  cos  {kt  +  pk),  we  have  f'{t)  =  -k  sin  {kt  +  cpk)  and 
f{t)  =  k  cos  {kt  +  (pk)y  so  that  (2.36)  reduces  to  (2.35).  Thus  (2.36)  is  proved. 

The  desired  inequality  (2.25)  follows  from  (2.36).  Indeed  by  (2.32)  (which  implies 
Fn-i{x)  ^  0)  and  (2.34),  for  any  fixed  a  and  i  e  R,  the  modulus  of  the  right-hand  side 
of  (2.36)  is  majorized  by 

11/11^  =  „||/||^. 

A=1  \  "  / 

Now,  for  any  fixed  f  G  R,  the  maximum  modulus  of  the  left-hand  side  of  (2.36)  (as  a 
varies)  is 

in/(i)  -  /'(t)i + 

and  the  proof  of  (2.25)  is  complete.  □ 

2.8.  Some  sources 

The  very  important  subject  of  Markov-Bemstein  type  inequalities  deserves  a  reasonably 
complete  monograph.  Pending  the  writing  of  such  a  book,  which  is  no  easy  task,  here  are  just 
a  few  good  sources  of  information  {among  many  others): 

1.  The  1983  book  “Les  inegalitds  de  Markoff  et  de  Bernstein”  by  Q.  I.  Rahman  and 
G.  Schmeisser  [31]  and  its  good  list  of  references.  This  (rather  short)  mimeographed 
book  is  written  in  a  delightfully  reader-friendly  style  reminiscent  of  those  of  S.  N. 
Bernstein  [4]  and  C.  de  La  Vallee  Poussin  [12].  A  modest  knowledge  of  French 
should  be  enough  for  reading  this  book. 

2.  P.  Borwein  and  T.  Erdelyi  (either  separately,  or  together,  or  with  other  co-authors)  have 
many  publications  on  this  subject  (and,  by  the  way,  on  several  other  topics  pertaining  to 
the  themes  of  this  paper  and  to  those  of  [39]).  Most  preprints  of  their  publications  can 
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be  downloaded  from  the  authors’  well  organized  home  pages  at  the  Web  sites  of  Simon 
Fraser  University  [9]  and  Texas  A&M  University  [14],  respectively.  Although  these 
reprints  are  not  always  the  final  forms  of  the  publications  and  do  sometimes  contain 
typos,  the  papers  are  of  a  high  level  and  also  good  sources  of  references  (modulo  a  few 
errors). 

3.  Many  Ph.  D  theses  and  other  publications  on  this  subject  produced  in  the  province  of 
Quebec,  Canada,  where  there  is  a  tradition  of  this  type  of  studies.  I  have  lost  track  of 
what  is  being  done  there,  but  the  reader  could  look  at  the  university  Web  sites  and  at 
the  publication  lists  of  “Presses  de  I’Universite  de  Montreal.”.  One  example  (among 
others)  is  the  1983  Ph.  D.  thesis  of  C.  Frappier  [17]. 

3.  Some  types  of  flat  polynomials 

If  a  polynomial  P(X)  =  E  C[Ar]  has  at  least  two  non-zero  coefficients, 

then  it  cannot  be  “perfectly  flat”  (i.e.,  have  constant  modulus)  on  the  whole  unit  circle:  The 
relation 

(3.1)  =  constant  (for  all  t  eR) 

is  impossible.  To  see  this,  call  OrX^  (resp.  OgX^)  the  non-zero  term  of  P(Ar)  of  lowest 
(resp.  highest)  degree,  (0  ^  r  <  5  ^  n),  and  note  that  |P(e**)p  is  a  non-zero  trigonometric 
polynomial  with  leading  term  2|ara5|  cos  {mt  +  ^  —  a)  where  m  =  s  -  r  >  0, 

Us  =  |o,|e’^. 

However,  for  a  variety  of  classes  of  non-monomial  polynomials  P{X),  the  ideal  (and 
impossible!)  “perfect  flatness”  situation  (3.1)  can  still  be  approximated  in  various  ways.  The 
diverse  notions  of  “flatness”  of  a  polynomial  (or,  more  frequently,  of  a  sequence  or  a  class  of 
polynomials)  often  refer  to  the  various  ways  of  approximating  the  ideal  situation  (3.1).  For 
example,  given  a  subset  F  (usually  a  subgroup)  of  the  unit  circle,  if 

(3.2)  \P(9)\  =  constant  (for  all  ^  G  F) 

then  we  say  that  the  polynomial  P{X)  is  perfectly  flat  on  the  set  F.  There  are  some  non-trivial 
open  problems  on  that  notion  of  perfect  flatness,  and  we  will  briefly  see  a  glimpse  of  them 
in  section  3.1  below.  Another  example  of  a  flatness  requirement  is  to  look  for  polynomials 
P[x)  =  J2k=o  ^  which  the  moduli  [aol,  |ai|, . . . ,  |a„|  are  given  non-negative 

numbers  and  for  which  the  sup-norm  ||P||oo  •—  is  either  as  small  as  possible 

{extremal  problem)  or  satisfies  some  smallness  requirement.  Thus,  if  we  impose  |a)t|  =  1 
for  all  A:  ==  0, 1, . . . ,  n  (or  even  without  this  restriction),  the  previous  problem  is  equivalent 
to  some  smallness  requirement  on  the  ''crest  factor''  (or  "peak  factor")  ||P||oo/||P||2  where 
II P II 2  denotes  the  P^-norm: 

/  1  ^27r  \  1/2  /  n 


(3.3) 
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Actually,  many  flatness  problems  for  polynomials  can  be  expressed  in  terms  of  compari¬ 
son  of  two  norms  in  the  vector  space  of  polynomials  of  degree  ^  n.  We  shall  take  brief  looks 
at  some  examples  in  the  sequel. 

3.1.  Perfect  flatness 

As  we  said  above,  a  polynomial  P{X)  -  ^  '"perfectly  flaP  on  a 

subset  r  of  the  unit  circle  if  (3.2)  holds.  It  is  easy  to  see  that  if  P  is  a  (finite  or  infinite)  sub¬ 
group  of  the  unit  circle  with  card  P  >  deg  P  (where  card  P  denotes  the  number  of  elements 
of  P),  then  the  constant  in  (3.2)  equals  ||P||2  defined  by  (3.3),  but  this  is  not  always  true  for  a 
subgroup  P  such  that  card  P  <  deg  P. 

So  the  perfect  flatness  (3.2)  cannot  hold  if  P  is  the  whole  unit  circle.  Assuming  P(0)  0 

and  deg  P  =  n,  it  is  easy  to  see  that  (3.2)  cannot  hold  either  if  card  P  >  2n,  but  that  whenever 
card  P  <  2n  there  are  some  such  P{X)  for  which  (3.2)  holds. 

An  interesting  open  problem  is  the  following:  Given  an  integer  n  ^  1,  find  the  largest 
value  $(n)  of  those  integers  d  ^  1  such  that  there  exists  a  polynomial  P(X)  =  Ylk=o^hX^  G 
€[X]  which  is  "unimodulaf"  (i.e.,  |afc|  =  1  for  all  A:  —  0, 1, . . . ,  n)  and  for  which  the  perfect 
flatness  (3.2)  holds  on  the  group  Pd  of  d-ih  roots  of  unity  in  C  The  computation  of  $(n) 
is  easy  for  small  n,  for  example:  <I»(1)  =  2,  <^(2)  =  4,  ^(3)  —  5,  $(4)  =  $(5)  =  6.  A 
list  of  values  of  $(n)  has  been  computed  (Bjorck  &  Saffari,  1998,  and  on-going  search  in 
2000-2001).  No  values  of  n  with  ^(n)  ^  n  +  3  have  been  found  so  far,  but  we  do  not  have 
enough  evidence  to  conjecture  that  one  always  has  either  $(n)  =  n  +  1  or  $(n)  —  n-\- 2. 
The  following  partial  results  are  known: 


(3.4) 

$(n)  ^  n  -t- 1 

(for  all  n  ^  1) 

(3.5) 

^(n)  ^  n-\-2 

(for  infinitely  many  n) 

(3.6) 

$(n)  ^  2n  —  1 

(for  all  n  >  1) . 

(3.4)  follows  from  elementary  results  (Gauss  sequences),  see  section  3.2  on  bi-unimodular 
sequences.  (3.5)  is  a  non-trivial  result  of  Bjorck  and  Saffari  (unpublished).  (3.6)  is  not  hard 
to  prove  for  even  n,  however  for  odd  n  it  can  be  shown  (Saffari,  unpublished)  to  be  equivalent 
to  a  profound  theorem  of  Dresel,  White  and  Hunt  (see  also  the  section  in  [39]  on  “Huffman 
sequences”). 

If,  instead  of  polynomials  with  complex  unimodular  coefficients,  we  consider  polynomials 
with  real  unimodular  coefficients  {i.e.,ak  -  ±1  for  all  A:  =  0, 1, ... ,  n),  then  (3.2)  becomes 
a  different  type  of  problem.  It  is  very  easy  to  check  that,  in  that  case,  (3.2)  never  holds  if 
card  P  ^  n  -I-  3.  It  can  be  proved  (Saffari  [38],  unpublished)  that  (3.2)  never  holds  either 
if  card  P  =  n  +  2.  As  for  the  impossibility  of  (3.2)  for  card  P  =  n  +  1  (except  for  n  =  3), 
it  is  equivalent  to  the  famous  “Hadamard  Circulant  Conjecture”  due  to  Ryser  [33],  an  open 
problem  going  back  to  the  1950’s  and  stating  that  a  circulant  matrix  of  order  L  (with  ±1 
entries)  cannot  be  a  Hadamard  matrix  unless  L  —  4. 
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3.2.  Bi-unimodular  sequences 

This  is  a  notion  which  we  will  define  a  little  later  and  which  will  turn  out  to  be  equivalent 
to  that  of  unimodular  polynomials  of  degree  n  which  are  perfectly  flat  on  the  group 
of  (n  +  l)th  roots  of  unity  in  C  (see  section  3.1  above).  I  mention  this  notion  at  this  point 
because,  historically,  it  can  be  considered  as  the  oldest  idea  of  “flat  polynomials”  since  it  goes 
back  to  Gauss.  Indeed,  let  L  be  any  odd  integer  ^  3,  and  consider  the  '"Gaussian  sequence'' 
of  length  L: 

(3.7)  0/=:=^^*=',  (A:  =  0,1,...,L-1) 

where  lj  =  exp  {2i7T/L)  is  the  first  primitive  L-th  root  of  1  in  C,  and  A  is  any  integer  relatively 
prime  to  L.  Equivalently,  we  may  consider  the  infinite  sequence  (a^),  {k  eZ),  as  periodic  of 
period  L.  Defining  the  normalized  discrete  Fourier  transform  (DFT)  of  as  the  sequence 

1 

(3.8)  fir  := •w’’*,  (r  =  0,l,...,L-l) 

fc=0 

we  can  easily  check  that  (a^)  is  unimodular  as  well:  ja^l  =  1  for  all  r  =  0, 1, . . .,  L  -  1.  This 
fact  was  known  to  Gauss  (to  whom,  by  the  way,  the  earliest  ideas  of  Fourier  Analysis  can 
be  tracked  down,  and  not  to  Fourier  or  Clairaut).  More  generally,  given  any  integer  L  ^  2, 
whether  odd  or  even,  any  finite  sequence  (a*),  (fc  =  0, 1, . . . ,  L  —  1),  of  L  complex  numbers 
will  be  called  "bi-unimodular"  if  it  has  modulus  one  (|aA:|  =  1  for  all  A:  =  0, 1, . . . ,  7/  —  1) 
and  if  its  normalized  DFT 

L-l 

V A:=0 

has  modulus  one,  too:  |dr|  =  1  for  all  r  =  0, 1, . . . ,  L  ~  1.  (The  term  “bi-unimodular”  was 
coined  by  G.  Bjorck  and  myself  in  our  1995  joint  paper  [7]).  Thus,  for  odd  L,  the  Gaussian 
sequence  (3.7)  is  an  example  of  a  bi-unimodular  sequence  of  length  L.  The  sequence  (3.7)  is 
not  bi-unimodular  when  L  is  even^  but  in  this  case  we  have  another  type  of  Gaussian  sequence 
which  is  bi-unimodular: 

(3.9)  (fc  =  0,l,...,L-l) 

where  ^  =  exp  (^Tr/L)  is  the  first  primitive  root  of  unity  of  order  2L  in  C,  and  A  again  any 
integer  relatively  prime  to  L.  Note  that,  while  (3.7)  is  bi-unimodular  for  odd  L  but  not  for 
even  L,  similarly  (3.9)  is  bi-unimodular  for  even  L  but  not  for  odd  L. 

Obviously  a  unimodular  sequence  (a^),  (/c  =  0, 1, . . . ,  L  —  1),  of  length  L  is  bi-unimod- 
ular  if  and  only  if  its  “associated  polynomial”  P{X)  =  (of  degree  n  ~  L  -  1)  is 

perfectly  flat  on  the  group  of  Z/-th  roots  of  1  in  C. 

The  starting  point  of  the  theory  of  bi-unimodular  sequences  was  an  oral  question  asked 
by  Per  Enflo  in  1983  at  Stockholm  University  [13]:  Ifp  is  a  given  odd  prime  number,  is  it  true 
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that  the  Gaussian  sequences  \  ^  integers  with  p  not  dividing 

Aj,  (/s  =  0, 1, . . .  ,p  —  Ij,  are  the  only  unimodular  sequences  of  length  p,  with  Oq  =  1,  whose 
normalized  DVT  has  modulus  one? 

In  our  present  vocabulary,  Per  Enflo  was  asking  whether  such  Gaussian  sequences  were 
the  only  normalized  (ao  =  1)  bi-unimodular  sequences  of  odd  prime  length.  If  the  answer 
had  been  “yes”,  it  would  have  helped  him  with  estimations  of  some  exponential  sums.  Later 
on  he  found  another  method  (not  requiring  an  answer  to  the  above  question)  to  carry  out  the 
estimations  of  his  exponential  sums  anyway. 

Forp  =  3  the  answer  is  trivially  “yes”,  and  forp  =  5  Lovasz  [24]  checked  that  the  answer 
is  “yes”  as  well. 

In  1984  G.  Bjorck  (Stockholm  University)  was  trying,  by  computer  search,  to  check  that 
for  p  =  7  the  answer  to  Per  Enflo’s  question  was  “yes”  as  well,  when  suddenly  the  counter¬ 
example 

(3.10)  (1, 1, 1,  1,  e*^,  e^^)  (with  0  =  arccos  (—3/4)) 


“popped  out”  (as  Bjorck  put  it!).  Later  on  in  1984  Bjorck  found,  again  by  computer  search, 
other  counter-examples  to  Per  Enflo’s  question,  including 

(3.11)  (l,l,e'^l,l,l,e^e'",e^l,e*") 


(with  a  =  arccos  (—5/6)).  When  early  in  1985  Bjorck  presented  these  counter-examples  at 
the  A.  Haar  memorial  Conference  [5],  he  still  had  no  idea  of  the  structure  of  the  sequences 
(3.10)  and  (3.11).  Actually,  if  in  the  sequence  (3.10)  [resp.  (3.11)]  we  replace  the  first  term 
by  zero  and  the  terms  [resp.  e*'^]  by  -1,  we  get  the  ''Legendre  symboL  sequence  modulo 
7  [resp.  mod  11]:  The  terms  [resp.  e**^]  are  located  at  the  quadratic  non-residues  (modulo 
7,  resp.  modulo  11).  This  is  a  fact  that  Bjorck  observed  a  little  later,  in  the  fall  of  1985,  and 
that  he  subsequently  generalized  ( [6],  1990)  to  every  prime  =  -1  (mod4): 

If  in  the  p-term  "Legendre  symboV*  sequence  (0, 1, . . . ,  —  1),  (p  any  prime  =  —  1  modulo 
4)  we  replace  the  first  term  zero  by  1  and  every  —1  by 


(3.12) 


exp 


arccos 


^-P\ 

I+W 


1  4-p  1  +p 


we  obtain  a  bi-unimodular  sequence  of  length  p,  with  only  two  values,  namely  1  and  the 
number  given  by  (3.12). 

In  sections  3.5  and  3.6  below  I  shall  determine  all  bi-unimodular  sequences  with  only 
two  values:  it  will  turn  out,  in  particular,  that  Bjorck’s  above  theorem  is  not  specific  to  prime 
numbers  but  can  be  extended  to  all  integers  v  (necessarily  =  —  1  modulo  4)  for  which  there 
exists  a  so-called  “Hadamard-Paley  cyclic  difference  set.”  Thus  it  will  be  seen  to  work  for 
u  =  15,  which  is  the  smallest  non-prime  v  with  this  property.  From  the  same  discussion  of 
sections  3.5  and  3.6  it  will  follow  that  such  a  bi-unimodular  sequence  (with  only  two  values) 
cannot  exist  for  any  length  =  1  (mod4).  However,  for  prime  lengths  —  1  (mod4),  Bjorck 
proved  (in  the  same  paper  [6])  the  next  best  thing: 
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If  in  the  p-term  Legendre  symbol*'  sequence  (0, 1, . . . ,  —  1, . . . ,  1),  (p  any  prime  =  1 
modulo  4)  we  replace  the  first  term  0  by  i,  every  term  1  by 


(3.13) 


T}  :=  exp  I  ^  arccos  - - 

V  P- 1 


tfy/p-l  ^  -ip  +  2S./p 

P  —  I  P  —  1 


(with  any  choice  ofS  =  dzl)  and  every  by  the  complex  conjugate  rj,  with  the  same  choice 
of  6  =  ±1,  then  we  obtain  a  bi-unimodular  sequence  of  length  p  (which  thus  has  first  term  1 
and  only  two  other  values,  namely  q  and  fj). 

In  the  1990’s  more  research  was  done  on  bi-unimodular  sequences  (Saffari  [37],  Bjorck 
&  Saffari  [7],  Haagerup  [19], ... )  and  there  are  some  very  nice  results  and  open  problems  on 
this  subject,  that  I  hope  to  discuss  in  [39]. 


3.3.  Digression  on  (?;,  k,  A)  difference  sets 

This  brief  digression  into  Combinatorics  (in  this  section  and  in  next  section  3.4)  is  just 
intended  to  recall,  for  the  benefit  of  the  harmonic  analyst  reader,  some  definitions  and  ele¬ 
mentary  facts  which  will  be  useful  in  the  discussion  (in  sections  3.5  and  3.6)  of  polynomials 
whose  coefficients  only  take  two  values  (whether  unimodular  or  not)  and  which  are  perfectly 
fiat  on  the  group  Fx,  of  L-ih  roots  of  1  in  C,  (L  =  1  +  deg  P). 

There  are  currently  two  entirely  different  and  unrelated  types  of  mathematical  objects, 
both  carrying  (unfortunately)  the  same  name  of  “difference  sets”.  We  will  be  concerned  only 
with  the  second  notion,  yet  I  will  define  both  just  to  avoid  any  possible  confusion  (as  I  have 
often  witnessed  instances  of  such  confusion  between  the  two  notions). 

The  set  A(5)  :=  {x  ~  y  :  x^y  £  5},  where  S  is  any  subset  of  an  (additive)  abelian 
group  G,  is  often  called  the  “difference  set”  of  S.  This  notion  is  well  known  to  all  analysts 
because  of  the  result  saying  that  if  G  —  R"  and  if  S  has  positive  Lebesgue  measure,  then  the 
origin  belongs  to  the  interior  of  A  (5)  and  therefore  A  (S')  has  non-empty  interior.  There  is 
an  enormous  literature  on  this  notion  and  its  extensions,  in  such  areas  as  analysis,  algebra, 
combinatorics  and  number  theory.  I  will  not  say  anything  else  on  this,  as  it  is  of  no  concern 
to  us  here. 

The  second  notion  of  “difference  sets”  (the  one  which  does  interest  us  here)  pertains  to 
finite  groups  only.  Let  G  be  any  finite  group,  abelian  or  not,  with  neutral  element  denoted 
by  e.  A  subset  D  of  G  is  called  a  left  (resp.  right)  (i;,  k,  A)  difference  set  if  card  G  =  v, 
card  D  =  k  and  the  intersection  (uD)  n  D  (resp.  (Du)  fi  D)  has  cardinality  A  whenever 
u  e  GyU  ^  e.  The  term  “difference  set”  is  due  to  the  fact  that  such  sets  were  first  considered 
in  additive  groups  Z/nZ.  Note  that  £)  C  G  is  a  left  (u,  k.  A)  difference  set  if  and  only  if 
D~^  :=  :  X  6  77}  is  a  right  (v,  A:,  A)  difference  set.  Also  if  77  is  a  left  (resp.  right) 

(t;,  k,  A)  difference  set,  then  its  complement  D'  :=  G\77  is  also  a  left  (resp.  right)  (v\  k',  A') 
difference  set,  with 


(3.14) 


v’  =  V,  k'  =  V  -  ky  \'  =  V  —  2k  -I-  X. 
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The  most  obvious  examples  of  difference  sets  are  the  following  four  types  (called  the  ^'trivial 
difference  sets''): 

1.  The  empty  set  0,  which  is  a  (u,  0, 0)  difference  set. 

2.  Singletons,  which  are  (u,  1, 0)  difference  sets. 

3.  D  —  Gy  which  is  a  (u,  u,  u)  difference  set. 

4.  Complements  of  singletons,  which  are  (u,  —  1,  u  -  2)  difference  sets  (if  v  ^  2). 

The  simplest  (and,  historically,  the  first)  example  of  a  non-trivial  difference  set  is  due  to 

Paley  [29]:  If  p  is  a  prime  =  —  1  (modA),  then  the  set  of  all  (non-zero)  quadratic  residues 
modulo  p,  and  also  the  set  of  all  quadratic  non-residues  modulo  p,  are  {v,kyX)  difference 
sets  with  V  =  p,  k  =  (p  -  l)/2,  A  =  (p  -  3)/4.  (These  sets  are  non-trivial  difference  sets  if 
p  ^  7,  and  trivial  (3, 1, 0)  difference  sets  if  p  =  3). 

In  the  general  case,  a  simple  counting  argument  shows  that 

(3.15)  k{k  -l)  =  {v-  1)A. 

Let  F  :  G  — >  C  be  any  complex-valued  function  defined  on  G.  Its  right  (resp.  left)  autocor¬ 
relation  function  is  the  function  7^  :  G  ^  C  (resp.  /y :  G  — >■  C)  defined  by 

(3.16)  7^(«)  :=  fl{n)  :=  . 

geG  geG 

If  is  the  characteristic  function  of  any  (a  priori  arbitrary)  subset  D  of  G,  Le.y  x(5)  —  1 

if  g  e  D  and  x(p)  =  0  if  5  ^  -G,  then  D  is  obviously  a  right  (resp.  left)  (u,  A:,  A)  difference 
set  if  and  only  if 

(3.17)  TxM  —  ^  if  u  =  e,  7^(u)  =  A  ifu^e 

(3.18)  resp.  ^7(u)  —  k  if  u  =  e,  ^(u)  —  X  ifu^e. 

This  yields  another  proof  of  (3,15).  Indeed,  for  example  by  (3.17), 

/  g^G  h&G  g&G  u^G 

=  Xt'xW  =  +  X ^ 

u€G  u^e 

(This  was,  of  course,  just  the  classical  convolution  product  argument). 

3.4.  Binary  functions  on  finite  groups  with  autocorrelation  functions  constant  outside  the 
neutral  element 

By  a  ‘‘'binary  function"  we  mean  complex- valued  functions  only  taking  two  values,  which 
can  be  arbitrary  complex  numbers,  possibly  of  modulus  >  1.  If  o;  €  C  and  e  C  are 
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these  values,  we  call  such  a  function  an  {a,  ^}-function.  Examples  of  binary  functions  are 
{0,  l}-functions,  {1,  —  l}-functions  (also  called  ±1  functions),  etc. 

Although  the  following  result  might  conceivably  not  have  appeared  elsewhere  in  the  ex¬ 
plicit  form  given  below,  it  is  nevertheless  probably  well  known  (in  essence)  to  many  people 
in  combinatorics  and  signal  processing,  at  least  in  the  case  of  abelian  or  cyclic  groups. 

Theorem  3.1.  Let  G  be  a  (not  necessarily  abelian)  finite  group  of  order  v  with  neutral 
element  e,  and  let  F  :  G  C  be  any  binary  function  with  values  {a,  fi},  (a  ^  p).  Then  the 
right  (resp.  left)  autocorrelation  function  ofF  is  constant  on  G'\{e}  if  and  only  ifF~^{a)  := 
{x  ^  G  :  F{x)  —  Q;}  is  a  right  (resp.  left)  (v,k,X)  difference  set.  In  that  case  the  right 
(resp.  left)  autocorrelation  function  has  value  70  >  0  a;  =  e,  and  the  same  (real)  value 
')  at  all  X  6  G\{e},  where 

(3.19)  7o  =  fc|Q;p  +  (w-fc)|/9p 
and  7  is  defined  by  any  of  the  three  (equivalent)  relations 

(3.20)  7  =  7o-(fe- A)|a-^P 

(3.21)  7  =  A|a|^  +  (u-2A;  +  A)|/3p 

(3.22)  \ka  +  {v-  =  70  +  (i;  -  1)7, 

SO  that  7  satisfies  the  inequalities 

(3.23)  -- ^<7<7o. 

The  proof  is  based  on  the  following  lemma,  which  is  also  useful  elsewhere. 

Lemma  3.1.  Let  H  :  G  C  be  any  complex-valued  function  on  a  (not  necessarily 
abelian)  group  of  order  v.  Put  J{x)  =  aH{x)  -H  b  with  a,b  E  C.  Then  the  right  autocorrela¬ 
tion  functions  7^  (u)  and  7^  (w)  satisfy 

(3.24)  7j(«)  =  |o|27„(«)  +  |f)p  •  V  +  23? J , 

\  / 

and  a  similar  identity  holds  for  the  left  autocorrelation  functions. 

Proof.  The  proof  of  the  lemma  is  straightforward.  To  prove  the  above  theorem,  we  may  just 
consider  the  right  autocorrelation  function,  as  the  change  of  variable  x  x~^  reduces  the 
left  case  to  the  right  one.  Let  x  •=  Xd  where  D  ~  F-\a).  Then  the  theorem  follows  from 
the  above  lemma  by  straightforward  calculations,  upon  noting  that  each  of  x(^)  and  F{x) 
can  be  expressed  in  terms  of  the  other  one  from  the  identity  F{x)  =  [a  -  f)x{^)  1^,  since 

a  —  /5  0.  □ 
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3.5.  Binary  functions  on  finite  groups  with  autocorrelation  functions  vanishing  outside  the 
neutral  element 

Let  G  be  any  finite  (not  necessarily  abelian)  group  of  order  v  ^  2,  with  neutral  element  e. 
Our  purpose  in  this  section  is  to  find  all  binary  functions  (and,  in  particular,  all  unimodular 
binary  functions)  F  :  G  -)•  C  for  which  the,  say,  right  autocorrelation  function  7^  satisfies 
7^(u)  =  0  for  all  u  €  (j\{e}.  Without  loss  we  may  assume  that  the  two  values  of  F  are 
1  and  some  ^  E  C,  (/?  1).  As  a  special  case  of  the  result  of  section  3.4,  with  7  =  0,  G 

contains  a  (right)  (u,  fc,  A)  difference  set  D  such. that  F{x)  =  1  if  x  e  D  and  F{x)  —  if 

X  E  G\D.  Since  the  case  |^|  =  1  (of  binary  unimodular  functions)  is  an  important  special 

case,  our  purpose  consists  of  addressing  the  following  two  problems: 

A)  If  D  C  G,  (D  ii),  is  some  (right)  (v,k,X)  difference  set,  find  all  those  ^  E  C  such 
that  the  binary  function  F  :  G  ^  C  defined  by  F{x)  =  1  if  x  ^  D  and  F{x)  =  p  if 
X  E  G\D  satisfies  7p(u)  =  0  for  all  u  E  <j\{e}. 

B)  Find  all  those  (right)  (v,  k,  A)  difference  sets  D  C  G  for  which  there  exists  at  least  one 
complex  number  fi,  with  \fi\  =  I,  so  that  the  above  (unimodular)  function  F  satisfies 
7^  (w)  =  0  for  all  u  ^  G\{e}. 

Solving  Problem  A):  Put  o;  =  1  and  7  =  0  in  (3.21),  to  get 

(3.25)  A  +  (t;  -  2A;  +  +  2{k  -  =  0. 

Recall  that  G\D  is  a  (right)  (t»',  A')  difference  set  with  v'  =  v,k’  =  v-k,  X'  =  v~2kFX. 

So  if  the  coefficient  A'  =  v  —  2/c  +  A  of  in  (3.25)  is  zero,  then  D  is  a  “trivial”  difference 
set  (see  section  3.3)  for  which  either  k  =  X  —  v  or  k  —  v  —  1,  X  =  v  -  2.  In  the  former  case 
the  left  side  of  (3.25)  reduces  to  A  =  w,  which  is  impossible  since  i;  7^  0.  In  the  latter  case 

(3.25)  reduces  to 

(3.26)  =  (witht;>2) 

and  those  /3  E  C  which  satisfy  (3.26)  constitute  a  vertical  line.  The  case  k  =  X  =  0  is 
excluded  since  we  supposed  ^  0.  The  last  possibility  for  D  to  be  a  “trivial”  difference  set 
(Le.,  now  a  singleton)  isk  =  1  and  A  =  0,  in  which  case  (3.25)  reduces  to 

(3.27)  (i;- 2)1^1^  +  297,5  =  0. 

If  V  =  2,  then  (3.27)  is  the  same  as  (3.26)  and  the  acceptable  E  C  are  all  the  =  it, 
(t  E  R).  If  z;  ^  3,  then  the  set  of  those  e  C  which  satisfy  (3.27)  is  the  circle  of  radius 
l/(z;  -  2)  with  center  of  abscissa  -!/(?;-  2)  on  the  real  axis. 

Now  if  £>  is  a  non-trivial  difference  set  (which  easily  implies  v  ^  7),  then  in  (3.25)  we 
have  A  ^  1,  X!  ^  1  and  k  —  X'^  1.  The  set  of  these  ^  E  C  satisfying  (3.25)  is  then  the  circle 
of  radius  y/k  —  A/A'  with  center  of  abscissa  —(A;  —  A) /A'  on  the  real  axis,  (A'  =  z;  —  2A:  +  A). 
Let  us  recapitulate  the  result  obtained  regarding  problem  A): 


226 


Bahman  Sqffari  /Some polynomial  extremal  problems  which  emerged  in, . , 


Theorem  3.2.  Let  G  be  a  (not  necessarily  abelian)  finite  group  of  order  v  %  with 
neutral  element  e.  Let  D  C  G,  (D  ^  %),  be  a  (right)  {v,  k,  A)  difference  set.  Define  a  binary 
function  F  :  G  C  by  F{x)  =  I  if  x  e  D  and  F{x)  —  f  if  x  e  G'Xje},  where  ^  e  Cis 
arbitrary.  Then  the  right  autocorrelation  function  7^  identically  vanishes  on  D\{e}  if  and 
only  if  (5  is  in  the  set  5^  of  those  /?  €  C  such  that 

(3.28)  A  +  A'|/3p  +  2(k  -  A)K^  =  0 

with  X'  =  V  -  2k-{-  X.  Also  5^  is  non-empty  if  and  only  if: 

a)  Either  D  is  the  complement  of  a  singleton,  in  which  case  is  the  vertical  line  = 

1  -  - 
1  2* 

b)  Orv  ^  3  and  D  is  a  singleton,  in  which  case  is  the  circle  of  radius  l/{v  -2)  with 

center  of  abscissa  —l/{v  —  2)  on  the  real  axis. 

c)  Or  D  is  a  non-trivial  difference  set,  in  which  case  is  the  circle  of  radius  y/k  —  A/A' 

with  center  of  abscissa  ~{k  -  X)/X'  on  the  real  axis. 

Before  proceeding  to  address  Problem  B,  let  us  make  an  interesting  remark:  Putting  D'  := 
G\D,  then  in  the  above  theorem,  whether  D  is  a  trivial  or  a  non-trivial  difference  set,  the  set 
S^,  C  C  is  always  the  transform  of  5^  by  the  inversion  whose  inversion-circle  is  the  unit 
circle. 

Solving  Problem  B).  This  boils  down  to  deciding  whether  the  set  of  the  above  theorem 
intersects  the  unit  circle.  So  let  us  examine  the  various  possibilities.  If  D  is  the  complement 
of  a  singleton  (with  v  >  2),  the  vertical  line  9R/3  =  1  -  |  intersects  the  unit  circle  if  and 
only  if  1  -  I  ^  “1,  that  is,  either  z;  -  2  or  =  3  or  i;  =  4.  For  i;  =  2  the  intersection 
points  are  —  ±i,  yielding  the  Gauss  sequences  (l,±i)  of  length  2  (see  section  3.2).  For 
v  =  3  the  intersection  points  are  /?  =  exp  (±2i7r/3),  yielding  the  Gauss  sequences  (1, 1,  j) 
and  (1,  0  and  their  obvious  modifications.  For  ~  4  the  intersection  point 

is  at  ^  =  -1,  yielding  the  ±1  functions  whose  values  a,  6,  c,  d  are  ±1  and  satisfy  abed  =  -1. 
For  t;  =  4  the  group  G  can  be  non-cyclic.  If  G  is  cyclic,  then  we  get  ordinary  ±1  sequences 
of  length  4  which  are  special  cases  of  “Barker  sequences”,  “Golay  sequences”,  “Shapiro 
sequences”,  “PONS  sequences”,  etc.  If  ^  3  and  D  is  a  singleton,  again  5^  ^  0  if  and  only 
if  V  —  3  or  ?;  =  4,  and  we  once  again  obtain  the  same  (Gauss)  sequences  of  length  3  and  the 
same  ±1  functions  as  above. 

We  now  come  to  the  case  when  D  is  a  non-trivial  difference  set,  and  this  is  the  only  non¬ 
trivial  part  (!)  of  this  section  3.5.  From  assertion  c)  in  the  above  theorem  it  easily  follows  by 
elementary  calculations  that  5^  intersects  the  unit  circle  if  and  only  if 


(3.29) 


v-~A{k-X)<,d. 
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On  the  other  hand,  upon  choosing  a  =  1  and  =  -1  in  Theorem  3.1,  we  infer  from  (3.19), 
(3.20)  and  (3,23)  that,  for  any  (u,  k,  A)  difference  set  with  u  ^  3, 

(3.30)  v~4{k-X)^ - — 

and  therefore  that  v  -  4(/c  -  A)  >  -1,  since  the  left  side  of  (3.30)  is  an  integer.  Thus  (3.29) 
can  be  satisfied  in  only  two  cases: 

(3.31)  Casel.  v  -  A{k  -  X)  = -1 

(3.32)  Case  2.  -  4{k  -  A)  =  0. 

Before  studying  theise  two  {very  important)  cases,  let  us  first  clarify  some  terminology.  Many 
authoritative  experts  (such  as  [2]  and  [18],  for  example)  call  (u,  fc,  A)  difference  sets  satisfy¬ 
ing  (3.31)  *‘Hadamard  difference  sets”  and  those  satisfying  (3.32)  “Menon  difference  sets”. 
Other  authoritative  experts  call  those  satisfying  (3.32)  “Hadamard  difference  sets”  and  those 
satisfying  (3.31)  “Hadamard-Paley  difference  sets”.  All  these  choices  of  names  are  histor¬ 
ically  justified,  yet  this  discrepancy  is  unfortunate  and  confusing.  So  I  humbly  propose  to 
adopt  the  following  choices  (and  make  everyone  happy): 

Definitions.  Difference  sets  satisfying  (3.3])  will  be  called  ''Hadamard-Paley  difference 
sets”.  Those  satisfying  (3.32)  will  be  called  "Hadamard-Menon  difference  sets”. 


Let  us  now  study  these  two  types  of  difference  sets  and  their  incidences  on  our  Problem 
B).  Elementary  arithmetical  calculations  show  that  (assuming  k  ^  v/2  without  loss)  the 
diophantine  identities  (3.15)  and  (3.31)  hold  simultaneously  if  and  only  if  the  parameters 
(u,  A;,  A)  of  the  (Hadamard-Paley)  difference  set  have  the  form 

(3.33)  u  =  4ti  —  1,  k  =  2n  —  l,  A  =  n  —  1 

and  that  the  diophantine  identities  (3.15)  and  (3.32)  hold  simultaneously  if  and  only  if  the 
parameters  (v^k,X)  of  the  (Hadamard-Menon)  difference  set  have  the  form 

(3.34)  ^;  =  4A^^  k  =  2N'^-N,  X  =  N'‘-N. 


Now  some  elementary  calculations  show  that  if  D  is  a  (right,  say)  Hadamard-Paley  dif¬ 
ference  set  as  defined  by  (3.33),  then  the  only  unimodular  solutions  P  £  C  of  our  above 
Problem  B)  are 


(3.35) 


13  =  exp 


^  arccos 


^  =  exp  ^ 


i  arccos 


l-2n\ 
l  +  2n  ) 


and,  of  course,  the  complex  conjugate  of  this  number.  Also  elementary  calculations  show 
that  if  D  is  a  (right,  say)  Hadamard-Menon  difference  set  as  defined  by  (3.34),  then  the  only 
unimodular  solution  ^  G  C  of  our  above  Problem  B)  is  ^  =  -1. 
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Also  observe  that  all  the  solutions  of  Problem  B)  in  the  case  of  trivial  difference  sets  are 
special  cases  of  the  above  discussion  (for  Hadamard-Paley  and  Hadamard-Menon  difference 
sets),  except  for  the  Gaussian  sequences  (1,  ±i)  of  length  2.  Let  us  now  recapitulate: 

Theorem  3.3.  Let  G  be  any  finite  (not  necessarily  abelian)  finite  group  of  order  v  ^  2, 
with  neutral  element  e.  Let  D  C  G,  (D  0),  be  a  (right)  {v,  fc,  A)  difference  set  (satisfying 
k  ^  v/2  without  loss).  Define  a  binary  unimodular  function  F  :  G  C  by  F(x)  =  I  if 
X  e  D  and  F(^x)  =  if  x  e  G\D  where  eC  and \l3\  =  1.  Then  the  (right)  autocorrelation 
function  7^  satisfies  7p(u)  =  0  for  all  u  €  G\{e}  if  and  only  if  we  are  in  one  of  these  three 
situations: 

Si)  V  =  2  and  F  is  one  of  the  (Gauss)  sequences  (1,  i)  and  (1,  —  0- 

b)  D  is  a  Hadamard-Paley  difference  set  given  by  (333)  and  ^  is  defined  by  (335)  or  is 
the  complex  conjugate  of  that  number. 

c)  D  is  a  Hadamard-Menon  difference  set  given  by  (3.34),  and  =  —1. 

3.6.  Binary  bi-unimodular  sequences 

The  case  of  binary  bi-unimodular  sequences  is  just  the  special  case  of  the  discussions 
and  results  of  section  3.5  when  the  group  G  is  cyclic.  The  extension  of  the  result  (3.12)  of 
Bjorck  [6],  promised  in  section  3.2,  is  the  special  case  of  Theorem  3.3  when  G  is  cyclic. 

This  now  leads  us  to  the  crucial  problem  of  the  existence  and  determination  of  such 
Hadamard-Paley  and  Hadamard-Menon  difference  sets.  Even  in  the  case  of  cyclic  groups 
these  existence  problems  are  far  from  being  entirely  solved.  Several  classes  of  cyclic  Hada¬ 
mard-Paley  difference  sets  are  known.  See  for  example  [2]  and  [18].  As  for  cyclic  Hadamard- 
Menon  difference  sets,  no  example  is  known  (except,  of  course,  for  v  =  4),  and  a  long¬ 
standing  conjecture  (due  to  Ryser  [33])  is  that  none  exists  if  t;  >  4.  This  is  equivalent  to 
the  famous  ''Hadamard  circulant  conjecture'"  which  states  that  no  circulant  matrix  of  order  v 
with  entries  ±1  can  be  a  Hadamard  matrix  unless  v  =  4. 

The  interpretation  of  binary  bi-unimodular  sequences  in  terms  of  polynomials  of  degree 
V  —  I  (with  binary  unimodular  coefficient  sequences),  which  are  perfectly  flat  on  the  group 
Ty  of  ?;-th  roots  of  1,  is  by  now  obvious.  We  will  come  back  to  these  in  some  depth  in  [39]. 

3.7.  Sup-norms  of  bi-unimodular  polynomials  on  the  unit  circle 

A  '"bi-unimodular  polynomial"  is,  of  course,  the  associated  polynomial 

L-l 

(3.36)  P(X)  =  ^a*X* 

A:=0 

of  a  bi-unimodular  sequence  (gq,  oi,  •  •  •  5 

A  useful  theorem  of  Landau  [21]  says  that  if  Q{X)  —  ^  C[A']  is  any  poly¬ 

nomial  with  complex  coefficients,  then  its  maximum  modulus  ||QI|oo  on  the  whole  unit  circle 
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is  majorized  in  terms  of  its  maximum  modulus  on  the  group  Tl  of  L-the  roots  of  1  as  follows: 

(3.37)  IIQIIcx,  <  C  -  (mMlQ(p)l)  -logL 

where  C  is  some  absolute  constant,  and  the  log  L  cannot  be  replaced  (in  the  general  case)  by 
a  smaller  factor.  This  can  be  proved  by  Lagrange  interpolation,  and  also  otherwise. 

Since  our  bi-unimodular  polynomial  (3.36)  satisfies  |P(p)|  =  VZ  for  all  5  G  Fl,  (3.37) 
implies  that 

(3.38)  ||P||oo<C%/IlogL. 

After  Bjorck  [6]  gave,  at  the  1989  ASI  on  '^Recent  Advances  in  Fourier  Analysis'’,  his  lec¬ 
ture  on  (what  we  subsequently  called  the)  bi-unimodular  sequences,  one  of  the  participants 
(perhaps  H.  S.  Shapiro  or  D.  J.  Newman,  if  my  memory  is  correct)  conjectured  that,  for 
bi-unimodular  polynormdXs  P{X),  (3.38)  could  be  improved  to: 

(3.39)  ||P|U  ^  C,Vl  (Cl  —some  absolute  constant), 

Le.,  P{X)  has  bounded  crest  factor  (see  beginning  of  part  3).  His  “evidence”  for  the  conjec¬ 
ture  (3.39)  was  twofold: 

A)  On  one  hand  the  conjecture  (3.39),  in  the  special  case  of  the  Gauss  sequences 

(3.40)  Uk  =  0)“^'+*’*’ 

(L  odd,  w  =  a  relatively  prime  to  L),  had  been  around  for  many  decades, 

perhaps  even  before  the  1920’s  and,  incidentally,  is  still  open  in  2000-2001. 

B)  On  the  other  hand  (3.39)  was  known  to  be  true  for  two  particular  classes  of  bi-uni¬ 
modular  sequences/polynomials,  namely  the  Gauss  sequences  (3.40)  with  the  choices 
a  =  {L  ±l)/2  (see  Littlewood  [22])  and  the  bi-unimodular  polynomials  (of  length 
L  —  M^)  studied  in  1977  by  Byrnes  [10]: 

M-l  M-1 

(3.41)  P(X)  =  Xm  e^hrin/M ^Mh+r ^ 

h=0  r=0 

I  did  some  thinking  on  the  conjecture  (3.39)  from  time  to  time,  until  I  saw  how  to  disprove 
it  in  June  2000: 

Theorem  3.4.  Ifp  is  a  prime  =  —l{mod4)  and  ak  is  the  binary  bi-unimodular  sequence 
of  length  p  (originally  introduced  by  Bjorck)  defined  by 


(.  I-P 

Ok  =  exp  %  arccos  — — 

V  1+P 
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if  k  is  a  quadratic  non-residue  (mod  p)  and  —  I  otherwise,  (0  ^  k  ^  p  —  1),  then  the 
associated  (bi-unimodular)  polynomial  satisfies,  for  all  sufficiently  large  p, 

(3.42)  llPlIoc,  >  -v^loglogp  -  I 

TT  Z 

and,  ife>0  is  fixed,  then  for  infinitely  many  such  primes  p 

(3.43)  ||P||oo  >  0  •  -  ej  v^loglogp 

where  7  is  Euler's  constant. 

These  results  are  immediate  consequences  of  Montgomery’s  1980  work  [28]  on  the  Fekete 
polynomials  ^  (p)  where  X  is  Legendre’s  symbol.  It  is  unforgivable  that  I 
have  not  seen  earlier  this  straightforward  disproof  of  conjecture  (3.39),  since  Montgomery 
had  given  me  a  copy  of  his  paper  [28]  as  early  as  1988. 

This  disproof,  in  turn,  gives  rise  to  interesting  results  and  problems  on  binary  bi-unimod¬ 
ular  sequences,  which  I  hope  to  discuss  in  [39]. 

4,  The  drunkard,  the  bar  owner  and  the  police 

Sorry,  dear  reader,  I  must  abruptly  stop  my  write-up  at  this  point.  I  have  a  deadline  to 
respect  and  I  had  underestimated  the  time  and  space  needed  to  just  touch  upon  some  most 
interesting  topics  I  had  in  mind  when  writing  the  introduction.  Below  is  a  list  of  some  of  these 
topics,  and  I  am  also  omitting  some  of  them,  with  the  intention  of  including  most  of  them  in 
the  extended  version  [39]  of  this  paper  (which,  in  a  way,  I  have  barely  started  to  write). 

4.1.  Some  topics  to  be  found  in  the  extended  version  [39] 

•  The  genesis  of  the  analytic  theory  of  flat  polynomials  from  nineteenth  century  Fourier 
analysis.  The  origins  from  the  absolute  convergence  of  trigonometric  and  Fourier  se¬ 
ries:  Dirichlet,  Fej^r,  Bernstein  and  many  others  up  to  the  present  days. 

•  The  various  Hardy-Littlewood  unimodular  polynomials  with  bounded  crest  factors. 
Their  connections  with  number  theory  and  Fourier  analysis. 

•  Bounded  crest  factor  properties  of  Gauss  and  Byrnes  polynomials  and  open  problems. 
The  van  der  Corput  methods  and  their  discrete  analogs  (Kuzmin,  Landau).  D.J.  New¬ 
man’s  method. 

•  D.  J.  Newman  and  H.  S.  Shapiro  in  New  York  in  the  late  1940’s.  The  making  of  the 
Shapiro  polynomials  at  M.I.T.  (1950-1951).  Raphael  Salem  and  the  Fekete  polynomi¬ 
als.  Infra-red  spectrometry  and  the  Golay  sequences.  History  of  Shapiro  polynomials 
and  Golay  sequences:  Rediscoveries,  truths,  lies,  counter-truths  and  folklore  galore. 

•  Barker  sequences  and  their  generalizations.  The  Hadamard  Circulant  Conjecture,  from 
Reiser  to  B.  Schmidt  via  R.  Turyn. 
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•  DJ.  Newman’s  conjecture  and  Barker  sequences.  Recent  (published  and  unpublished) 
research. 

•  Norms  of  unimodular  polynomials.  History  of  Erdos-Newman  conjectures.  Little- 
wood’s  work  and  his  various  conjectures  and  ''counter-conjectures"'.  The  strange  story 
of  the  Byrnes  pseudo-polynomials,  Komer’s  “true-false”  theorem,  Kahane’s  ultra-flat 
polynomials  and  further  on-going  research. 

•  Littlewood’s  conjecture  on  iZ-norms  of  exponential  sums:  Exciting  research  from 
Paul  Cohen  (late  1950’s)  to  Ivo  Klemes  (early  2000’s). 

4.2.  Time  to  stop 

The  above  list  is  incomplete.  It  is  pointless  to  try  to  remember  all  the  topics  that  I  had 
intended  to  put  in  this  paper,  as  I  have  to  stop  anyway.  I  deliberately  refrained  from  giving 
any  references  in  the  above  section  4.1,  as  there  is  just  too  much  exciting  work  (whether  old, 
recent,  unpublished  or  on-going)  to  quote.  It  is  better  to  do  a  rather  thorough  job  in  [39]  than 
a  sloppy  job  here. 

I  find  myself  in  the  same  situation  as  a  drunkard  sitting  late  at  night  in  a  bar  with  a  (nearly) 
full  bottle  of  whiskey  he  intends  to  drink,  who  is  suddenly  told  by  the  bar  owner  that  it’s  now 
closing  time  and  that  the  police  are  about  to  arrive  any  second  and  enforce  the  regulations. 
The  drunkard  has  to  leave  the  bar,  and  I  have  to  stop  here.  By  the  time  this  (quasi-aborted) 
paper  appears,  I  hope  the  extended  version  [39]  will  be  available  for  whoever  is  interested. 
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Abstract.  This  is  a  survey  of  some  recent  results  on  the  phenomenon  of  the  “in¬ 
visible  spectrum”  for  Banach  algebras.  Function  algebras,  formal  povi^er  series  and 
operator  algebras  are  considered.  This  includes  a  quantitative  treatment  of  the  famous 
Wiener-Pitt-Sreider  phenomenon  for  convolution  measure  algebras  on  locally  com¬ 
pact  abelian  (LCA)  groups.  Efficient  sharp  estimates  for  resolvents  and  solutions  of 
higher  Bezout  equations  in  terms  of  their  spectral  bounds  are  considered.  The  smallest 
spectral  “efficiency  hull”  of  a  given  closed  set  is  introduced  and  studied.  Using  the 
spectral  hulls  we  define  uniformly  bounded  functional  calculi  for  elements  of  the  al¬ 
gebras  in  question.  This  program  is  realized  for  the  measure  algebras  of  LCA  groups 
and  for  the  measure  algebras  of  a  large  class  of  topological  abelian  semigroups;  for 
their  subalgebras — the  (semi)group  algebra  of  LCA  (semi)groups,  the  algebra  of  al¬ 
most  periodic  functions,  the  algebra  of  absolutely  convergent  Dirichlet  series,  as  well 
as  for  the  weighted  Beurling-Sobolev  algebras,  for  quotient  algebras,  and  for 
some  finite  dimensional  algebras. 
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Part  1.  Quantitative  version  of  the  Gelfand  theory 
1.  Introduction:  Efficient  Inversion 
1.1.  Basic  motivations 

There  are  three  classical  problems  of  harmonic  analysis  and  function  theory  related  to  a 
phenomenon  we  call  invisible  spectrum.  Formal  definitions  are  contained  in  Subsection  1.2 
below. 

The  first  problem  comes  from  convolution  equations  and  is  related  to  what  is  usually 
called  *‘the  Wiener-Pitt  phenomenon”.  Namely,  let  G  be  a  locally  compact  abelian  group 
(LCA  group)  written  additively,  and  M{G)  the  convolution  algebra  of  all  complex  measures 
on  G.  The  fundamental  problem  is  to  find  an  invertibility  criterion  for  measures  p  G  M{G), 
that  is,  a  criterion  for  the  existence  of  6  M.{G)  such  that  p  *  u  =  5o,  Sq  being  the  unit  of 
M.{G)  (the  Dirac  ^-measure  at  0).  An  obvious  obstruction  for  invertibility  is  the  vanishing 
of  the  Fourier  transform  p{'y)  =  0  at  a  point  7  of  the  dual  group  G,  since  the  equality  pi>  =  1 
is  equivalent  to  the  initial  convolution  equation.  Generally,  the  boundedness  away  from  zero 

(1.1)  5=  inf  |/x(7)|  >  0 

is  necessary  for  p  to  be  invertible.  N.  Wiener  and  R.  Pitt  (1938),  and  Yu.  Sreider  (1950,  a 
corrected  version  of  Wiener  and  Pitt’s  result)  discovered  that,  in  general,  this  is  not  sufficient. 
Namely,  there  exists  a  measure  p  on  the  line  R  whose  Fourier  transform 

(1.2)  dii(x),  j/  e  R 

JR 

is  bounded  away  from  zero  but  there  exists  no  measure  z/  G  A^(R)  such  that  z>(y)  =  '^/piv) 
for  y  G  R.  This  result  still  holds  true  for  an  arbitrary  LCA  group  G  which  is  not  discrete, 
see  [Rul],  [GRS],  [HR]  for  references  and  historical  remarks.  Using  the  Banach  algebra  lan¬ 
guage,  one  can  say  that  the  dual  group  G,  being  the  “visible  part”  of  the  maximal  ideal  space 

=  9!Jt(A)  of  the  algebra  A  =  Ad(T),  is  far  from  being  a  dense  subset  of  931.  Nonetheless, 
later  on  we  will  see  that  some  quantitative  precisions  of  (1 . 1),  namely  a  closeness  of  the  norm 
||/i||  and  5  of  (1.1),  lead  to  the  desired  invertibility,  and  even  to  a  norm  control  of  the  inverse. 

On  the  contrary,  for  a  discrete  group  G,  the  classical  Wiener  theorem  on  absolutely  con¬ 
vergent  Fourier  series  tells  that  condition  (1.1)  implies  the  invertibility  of  p,  and,  moreover, 

—  G.  However,  in  this  setting  too,  the  problem  of  the  norm  control  for  inverses  p~^  is  still 
meaningful  and  interesting,  in  spite  of  the  Wiener  theorem,  since  the  latter  does  not  yield  any 
estimate.  In  fact,  from  the  quantitative  point  of  view,  there  is  no  big  difference  between  these 
qualitatively  polar  cases  (we  mean  the  cases  of  nondiscrete  and  discrete  groups).  It  turns  out 
that,  in  both  cases,  one  can  control  the  norms  ||^“^||  for  (^  >  0  close  enough  to  the  norm  \\p\\, 
but  this  is  not  the  case  for  small  (5  >  0. 

The  second  problem  we  are  interested  in  is  to  distinguish,  among  all  unital  Banach  al¬ 
gebras  A,  those  permitting  an  estimate  of  the  resolvents  only  in  terms  of  the  distance  to  the 
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Spectrum.  More  precisely,  we  want  to  know  for  which  algebras  A  there  exists  a  function  (/? 
such  that 

(1.3)  ||(Ae-/)“*||  ^  v(dist(A,(T(/))) 

for  all  A  G  C\a{f)  and  all  /  G  A,  ||/||  ^  1.  Here  e  stands  for  the  unit  of  A,  and  a{f) 
for  the  spectrum  of  /  in  the  algebra  A.  We  treat  this  problem  in  a  more  general  context  of 
norm-controlled  functional  calculi,  that  is,  as  a  partial  case  of  the  norm  estimates  problem  for 
functions  operating  on  a  Banach  algebra.  See  Chapter  2  for  more  details. 

The  third  problem  is  a  multi-element  version  of  the  previous  two.  Postponing  precise 
definitions  and  discussions  to  Subsection  1 .2,  we  mention  here  the  classical  corona  problem 
for  the  algebra  (O)  of  all  bounded  holomorphic  functions  on  O,  an  open  subset  of  or 
of  a  complex  manifold.  Recall  that  the  problem  is  to  solve  the  Bezout  equations 

n 

(1.4)  = 

k=l 

in  the  algebra  where  the  data  fk  G  satisfy  an  analogue  of  condition  (1.1), 

(1.5)  = 

k=l 

and  to  estimate  solutions  G  The  Banach  algebra  meaning  of  the  corona  problem 

is  well  known;  namely,  the  existence  of  solutions  for  any  data  satisfying  (1.5)  is 

equivalent  to  the  density  of  the  “visible  part”  of  the  spectrum  ifl  ~  in  971. 

In  what  follows,  we  consider  a  norm  refinement  of  this  problem  for  several  algebras  different 
from 

1.2.  (In)Visibility  levels.  Main  problems 

Let  ^  be  a  commutative  unital  Banach  algebra  and  be  a  subset  of  the  maximal  ideal 
space  of  A  (the  spectrum  of  A),  9Jt  =  9H(A),  X  C  971(^4).  We  write  /  i — )■  f  (m)  for  m  G  M, 
or  simply  /  i — >  /(m),  for  the  Gelfand  transform  of  an  element  f  £  A,  and  hence,  staying  on 
X,  we  can  embed  A  into  C'(X),  /  i — >  f{x)  =  5x{f)  for  x  e  X,  where  (5^;  G  97t  stands  for 
the  point  evaluation  5x(f)  =  f{x),  f  e  A.  In  what  follows,  we  regard  closX  as  the  visible 
part  of  971. 

Recall  that  the  spectrum  f7(/)  of  an  element  /  G  A  coincides  with  the  range  /(971)  of  the 
Gelfand  transform.  The  following  definition  formalizes  different  levels  of  “visibility”  of  the 
spectrum. 

Definition  1.2.1.  The  spectrum  of  A  is  called  n-visible  (or,  n-visible  from  X),  n  = 
1,2,.. .,  if  /(971)  =  clos  (/(X))  for  all  n-tuples  /  —  (/i, . . . ,  /„)  G  A”  =  A  x  . . .  x  A;  and 
it  is  called  completely  visible  if  971  =  clos  X. 
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It  is  clear  that  (n  +  1) -visibility  implies  n-visibilty  for  any  n  ^  1,  and  the  complete 
visibility  is  equivalent  to  n-visibility  for  all  n  ^  1.  Moreover,  the  Gelfand  theory  of  maximal 
ideals  makes  evident  the  following  lemma. 

Lemma  1.2.2.  Fora  commutative  unital  Banach  algebra  A,  the  following  properties  are 
equivalent. 

(i)  The  spectrum  of  A  is  n-visible. 

(ii)  For  every  /  =  (/i, . .  • ,  /n)  e  satisfying 

n 

(1.6)  =:  inf  y^|/A:(i)|^  >  0, 

fc=:l 

there  exists  an  n-tuple  g  e  A^  solving  the  Bezout  equation 

n 

(I-V)  'Z,gkfk  =  e. 

k=l 

In  this  language,  the  Wiener-Pitt  phenomenon  is  exactly  the  1 -invisibility  of  the  spectrum 
for  the  measure  algebra  M{G),  if  we  stay  on  the  dual  group  X  —  G.  Rigorously  speaking, 
we  mean  the  algebra  of  Fourier  transforms  TM{G)  =  {ft  :  p  e  M{G)}  endowed  with  the 
norm  ||/i||  =  \\p\\  and  embedded  into  C{G).  In  what  follows,  we  systematically  identify  these 
algebras. 

The  next  definition  specifies  the  previous  one  in  the  case  of  norm  controlled  invertibility 
instead  of  simple  invertibility. 

Definition  1.2.3.  Let  A  and  X  be  as  above,  and  let  0  <  ^  1.  The  spectrum  of  A 

is  called  ((5  —  n) -visible  (from  X)  if  there  exists  a  constant  Cn  such  that  any  Bezout  equation 

(1.7)  with  data  /  =  (/i,  ...,/„)  G  vl"  satisfying  (1.6)  and  the  normalizing  condition 

(1-8)  II/IP’-EIIMNi 

k=l 

has  a  solution  g  e  with  ||^||  c^.  The  spectrum  is  called  completely  5-visible  if  it 

is  —  n)-visible  for  all  n  1  and  the  constants  Cn  be  chosen  in  such  a  way  that 

Cn  <  00. 

Clearly,  there  exist  the  best  possible  constants,  in  the  following  sense.  Setting 

n 

(1.9)  c„(i5)  =;  c^{S,A)  =  Cn{5,A,X)  =  sup{mf(||s||  :  y^gkfk  =  e,  g  6  A")}, 

^  k=\ 

where  the  supremum  is  taken  over  all  /  G  A"  satisfying  (1.8)  and  (1.6),  we  get  the  smallest 
number  for  which  Cn  —  c„(5)  +  e  meets  the  requirements  of  Definition  1.2.3  for  every  e  >  0. 
In  particular, 

(1.10) 


ci((5)  =  sup{||/  'll  :  /  e  A;  (S  ^  |/{a:)|  ^  ||/||  ^  1,  a:  6  X}, 
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and,  in  this  case,  we  can  take  Ci  =  Ci(5)  in  Definition  1.2.3;  here  and  in  what  follows  we 
formally  set  ||/~^||  —  oo  for  noninvertible  elements  of  A. 

We  define  the  n-th  critical  constant  Sn{A^  X)  by  the  relation 

5n{A,X)  -  inf{(^ :  Cn{S,A,X)  <  00} 

1.2.4.  Main  problems.  Our  main  objective  is  to  estimate  from  above  and  from  below, 
and  (if  possible)  to  compute  the  critical  constants  6n[A^  X)  and  the  majorants  c„((5,  A,  X)  for 
basic  Banach  algebras  A  and,  thus,  to  study  norm  controlled  visibility  properties  for  these 
algebras. 

For  n  =  1  we  deal  with  more  general  objects.  Namely,  we  are  interested  in  describing 
functions  boundedly  acting  on  a  given  algebra.  Supposing  that  a  “visible  part”  of  the  spectrum 
is  fixed,  X  C  991(A),  we  can  say  that  a  function  (say,  cp)  defined  on  <7  C  C  acts  on  an  algebra 
A  if  for  every  a  G  A  with  d(X)  c  a  there  exists  b  e  A  such  ihdXipod  =  6  on  X.  A  “bounded 
action”  is  defined  in  the  same  way  but  adding  an  estimate  of  the  form  ||6||  <  A;||<^||=^,  where 
k  =  k{(j,  A,  X),  and  ||  •  ||*  is  an  appropriate  norm  majorazing  ||9?||^  —  sup^  \ip\.  We  mention, 
however,  that  such  a  definition  is  too  broad  to  be  useful:  the  spectral  inclusion  d{X)  c  a 
alone,  without  any  norm  restrictions,  cannot  imply  the  boundedness  of  compositions.  We 
formalize  this  statement  as  follows. 

Lemma  1.2.5.  Assume  that  |a(a;)|  ^  5{x  G  X)  always  implies  ||a“^||  <  C,  for  some 
positive  6  and  C.  Then  A  is  a  uniform  algebra  whose  norm  is  equivalent  to  the  sup  norm  on 
X:  \\d\\x  ^  ||a|l  <  {2C-{-S)\\d\\xforallaeA. 

The  lemma  shows  that  certain  norm  requirements  are  necessary.  The  classical  results 
on  functions  operating  on  Fourier  transforms  show  many  specific  examples  of  this  kind,  see 
[HKBCR],  [Rul]  and  further  references  therein. 

In  paper  [N4],  adding  the  normalizing  condition  ||a||  ^  1  to  the  spectral  inclusion  d{X)  C 
a  we  look  for  a  “minimal  spectral  hull”  h{a)  =  h{a,  A,  X)  such  that  functions  holomorphic 
on  h{a)  boundedly  act  on  a  given  algebra.  Our  approach  to  this  problem  is  explained  in 
Section  2. 

1.3.  Outline  of  the  theory 

The  main  goal  of  the  theory  presented  below  is  to  estimate,  from  above  and  from  below, 
and  (if  possible)  to  compute,  the  critical  constants  (5„(A,  A)  and  the  majorants  A,  X) 
for  some  commutative  Banach  algebras  frequently  used  in  harmonic  analysis  and  for  the  cor¬ 
responding  (customary)  visible  parts  X  of  their  spectra.  The  basic  algebras  are  the  following 
ones: 

(/)  the  measure  algebra  M{G)  on  an  infinite  LCA  group  G  with  X  —  G,  and  in  partic¬ 
ular,  the  Wiener  algebra  W  —  (Z)  of  absolutely  convergent  Fourier  series  with  X  =  T 

(Section  5); 
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(ii)  the  analytic  Wiener  algebra  with  X  =  D  =  {2  €  C  :  |^|  ^  1} 

(Section  5); 

(Hi)  the  weighted  Beurling-Sobolev  algebras  of  positive  spectral  radius,  both  analytic 
(with  X  —  ID),  and  “symmetric”  !Fl^(Z,w)  (with  X  —  T),  for  some  class  of 
regularly  growing  weights  w{n)  tending  to  00  as  |n|  — >  00  (Section  7). 

In  Section  2,  following  [N4],  we  introduce  two  general  methods:  a  method  for  upper 
estimates  of  the  visibility  constants  Cn{S^  X),  and  a  method  for  their  lower  estimates.  The 
method  for  lower  estimates  for  c„((5,  A,  X)  described  in  Subsection  2.3  refers  to  elements  of 
A  with  “almost  independent”  powers.  In  Section  5  we  specify  this  method  for  the  algebras 
M.{G),  M{S)  using  the  Sreider  measures,  which  are  defined  as  measures  with  real  Fourier 
transforms  and  the  spectrum  filling  in  a  disc.  In  particular,  a  short  proof  is  given,  following 
[N4],  to  the  fact  that  (^i(W+,  ID)  =  1/2  and  Ci((^)  =  [26  -  1)"^  for  1/2  <  S  ^  I  and  ci(6)  — 
00  for  5  ^  1/2.  A  different,  longer  but  more  elementary  proof  of  this  fact  is  contained 
in  [ENZ].  Independently,  another  elementary  proof  of  the  equality  (5i(FF|.,D)  =  1/2  was 
recently  presented  by  H.S.  Shapiro  at  the  6th  St.  Petersburg  Summer  Analysis  Conference, 
see  [Sh2]. 

In  the  same  Section  5,  we  show  that  ci(S,XM{G),G)  =  cxd  for  0  <  5  ^  1/2,  and 
thus  6i{J^M(G),G)  ^  1/2,  for  every  infinite  LCA  group  G.  Moreover,  (26  —  1)“^  ^ 
ci{S,J^M{G),G)  ^  Cn{S,TM(G),G)  for  1/2  <  S  ^  1,  and  Cn{S,  TM{G),G)  ^  (26^  - 
1)“^  for  I/V2  <6^1.  Again,  these  results  are  contained  both  in  [N4]  and  [ENZ]  but  the 
proofs  are  different. 

It  should  be  noted  that  some  of  these  results  were  mentioned  even  in  Shapiro’s  paper  [Shi] 
(Remarks  2  and  3,  and  a  footnote  on  page  235  of  [Shi]),  where  they  were  attributed  to 
Y.  Katznelson  (for  Ci{6,W,J)  =  00  for  ^  <  1/2),  to  Y.  Katznelson  and  D.  J.  Newman 
(for  Cl  (6,  VF,  T)  <  00  for  >  1  /  VS),  and  to  Bell  (for  Ci  (6,  VF+ ,  1)  <  cx)  for  5  >  1  /2),  but  at 
present  we  cannot  specify  references. 

Section  6  contains  some  results  on  the  efficient  inversion  on  some  finite  groups  and  semi¬ 
groups  (again,  following  [N4]). 

In  Section  4,  we  deal  with  a  more  general  framework,  namely  with  the  norm-controlled 
functional  calculi  and  the  so-called  spectral  efficiency  hulls  h{a,  A,  X)  introduced  (following 
[N4])  in  the  same  section.  The  links  of  efficiency  hulls  with  the  norm-controlled  calculi  are 
established  and  a  description  of  these  hulls  for  the  measure  algebra  Xf(S)  on  a  semigroup  S 
is  given  using  the  horodisc  expansions. 

The  results  and  the  methods  employed  for  weighted  convolution  algebras  are  completely 
different.  In  Section  3,  following  [ENZ],  a  general  method  is  developed  that  allows  us  to 
control  the  inverses  in  terms  of  S  —  inffm{/i)  1^1  for  rotation  invariant  topological  Banach  al¬ 
gebras  A.  More  precisely,  our  goal  in  Section  3  is  to  give  a  method  for  proving  the  equality 
Sl{A,^J0l)  =  0.  This  method  relies  on  two  ideas.  First,  to  estimate  ||:c“^||^,  we  use  the  mul¬ 
tipliers  mult  (VA)  of  the  space  VA  of  derivatives  of  our  algebra  A,  where  D  =  2^.  The 
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second  idea  is  to  deduce  estimates  from  the  compactness  of  the  embedding  A  c  mult  {VA). 
In  applications,  the  main  point  is  precisely  in  the  proof  of  this  compactness  and  in  estimat¬ 
ing  the  rate  of  decay  of  relevant  best  polynomial  approximations  in  the  mult  {VA)  norm. 
Following  [ENZ],  we  realize  this  approach  in  Section  7  for  the  Beurling-Sobolev  algebras 
A  =  lP(w)  of  positive  spectral  radius,  r{A)  =  lim„  >  0,  both  on  Z  and  Z+. 

Historical  remarks.  As  is  already  mentioned,  the  prehistory  of  the  ideas  presented  in 
this  paper  was  started  with  the  classical  theorems  of  Wiener-Levy  and  Wiener-Pitt-Sreider 
quoted  above. 

The  second  wave  of  results,  sharpening  the  Gelfand  and  Riesz-Dunford  functional  calculi, 
was  devoted  to  functions  operating  on  Fourier  transforms,  and  was  mostly  due  to  H.  Helson, 
J.-P.  Kahane,  Y.  Katznelson,  and  W.  Rudin.  The  main  problem  considered  and  resolved  was 
to  describe  functions  tp  defined  on  an  interval  a  =  [a,b]  C  R  and  such  that  p{J^f)  G  TA 
for  every  f  e  A  with  J^f(G)  C  a,  where  A  =  M{G)  or  A  =  L\G).  See  [HKKR], 
[Rul],  [GMG],  [Kl],  [HR]  for  exhaustive  presentations  and  further  references.  Nonanalytic 
functions  operating  on  certain  weighted  algebras  of  Fourier  transforms  occurred 

in  the  papers  of  J.-P.  Kahane,  [K2],  and  N.  Leblanc  [L].  However,  no  quantitative  aspects 
similar  to  those  of  Sections  1-2  were  explicitly  presented. 

The  third  wave  of  results  related  to  norm-controlled  calculi  can  be  linked  with  construc¬ 
tive  proofs  of  the  Wiener-Levy  theorem  on  inverses.  We  mention  the  proofs  by  A.  Calderon, 
presented  in  [Z],  by  P.  Cohen  [C],  and  by  D.  Newman  [New].  The  Calderon  approach  was 
developed  by  E.  Dyn’kin  [D]. 

The  problem  of  norm-controlled  inversion  (for  the  Wiener  algebra  A  =  l^(Z))  was  first 
mentioned  by  J.  Stafney  in  [St],  where  the  existence  of  a,  b,K  >  0  was  proved  such  that 
sup{||/"^|U  :  ll/ll  <  kJ{T)  C  [a,b]}  =  00.  This  implies  that  Ci (5,  A,  T)  =  oo  fox  some 
(5  >  0.  The  proof,  based  on  Y.  Katznelson’s  results,  does  not  permit  to  specify  the  value  of  6. 
Independently,  and  in  a  more  constructive  way,  the  result  was  obtained  by  H.  Shapiro  [Shi] 
(in  response  to  a  question  by  a  physicist  G.  Ehrling),  but  also  without  any  concrete  value  of  (5. 
Several  remarks  were  made  in  Shapiro’s  note  attributing  to  various  authors  certain  estimates 
for  quantities  we  call  the  critical  constants  5i(/^(Z+),D)  and  (5i(/^(Z),T);  no  precise  refer¬ 
ences  were  given.  In  fact,  the  paper  [N4]  was  inspired  by  Shapiro’s  construction.  In  the  paper 
of  J.-E.  Bjbrk  [B],  a  problem  related  to  uniform  functional  calculi  was  considered.  In  our  lan¬ 
guage,  it  is  equivalent  to  an  estimate  for  (5f  (A,  Sai(A)),  a  microlocal  version  of  Si  (A,  911(A)) 
studied  in  Section  4  below.  In  [B],  a  criterion  was  given  in  terms  of  another  quantity  which 
can  be  regarded  as  a  “uniform  spectral  radius”  of  the  algebra.  O.  El-Fallah  [E]  recently  gave 
an  application  of  Bkdrk’s  result  to  the  algebras  P{w)  with  slowly  growing  w. 

In  the  paper  of  S.  Vinogradov  and  A.  Petrov  [VP],  a  description  was  given  of  Banach 
spaces  A  of  functions  on  T  satisfying  the  following  property:  f  e  A  and  |/|  >  <5  on  T  imply 
1/f  e  A. 
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We  finish  these  remarks  mentioning  that  the  case  of  higher  Bezout  equations  (the  “corona 
problems”)  related  to  the  constants  Cn(<5,  A,  X)  and  X)  forn  >  1  is  considered  in  this 
paper  very  briefly.  For  their  history  see  [Gar],  [Nl],  as  well  as  [Tol]  and  [To2]. 

2.  How  and  Why  One  Can(not)  Control  the  Inverses 

Let  A  be  a  commutative  Banach  algebra  with  unit  e,  and  let  X  be  a  Hausdorff  topological 
space  such  that  A  is  continuously  embedded  into  C(X),  so  that  X  C  We  also  use 

other  notation  introduced  in  Section  1.  The  set  of  invertible  elements  of  A  is  denoted  by 
Q{A).  We  start  with  simple  observations  on  the  critical  constant  (^i. 

2.1.  First  observations 

We  say  that  A  is  an  algebra  of  distance  controlled  resolvent  growth  if  there  exists  a  mono¬ 
tone  decreasing  function  f) :  —  (0,  co)  such  that 

(2.1)  ||(Ae-/)~^ll  <  9?(dist(A,cr(/)),  XeC\a{f) 

for  all  f  e  A,  ll/ll  ^  1.  It  is  easy  to  see  that  this  estimate  implies  ||(Ae  -  f)~^\\  ^ 
i^(/?(i^dist  (A,(7(/)))  for  a\\  f  e  A  and  A  G  C\a(/).  It  is  shown  in  [N4]  that  a  com¬ 
mutative  Banach  algebra  A  obeys  the  distance  controlled  resolvent  growth  if  and  only  if 
(5i(A,9?t(y4))  =  0  (the  critical  constant  introduced  in  Section  1).  In  fact,  Ci(5j  A,9Jt)  ^ 
(p{6)  ^  Cl (5(2  -1-  5)"\  A,  Dm).  Similar  equivalences  hold  for  the  constant  5i(^,  AT)  related 
to  a  subset  X  C  ^{A),  and  for  (5  -  n)-visibility  and  the  “n-resolvent”  (Ae„  -  /)“\  where 
Ae„  —  (Aic, . . . ,  Anc)  G  .4",  A^  G  C. 

For  many  examples  of  function  Banach  algebras  with  various  behaviour  of  constants  5„ 
and  Cn(5),  including  the  classical  ones  (like  A  =  we  refer  to  [N4],  [ENZ]. 

2.2.  Splitting  X  -symmetric  algebras  for  upper  estimates 

As  before,  we  consider  a  commutative  unital  Banach  algebra  A  continuously  embedded 
into  the  space  C{X),  where  X  C  ilK(A).  Now,  we  need  the  following  definition,  see  [N4]. 

Definition  2.2.1.  We  say  that  an  algebra  A  splits  at  the  unit  if  there  exists  a  subspace 
Ao  C  A  such  that  A  =  e  •  C  +  Aq  (a  direct  sum)  and  for  /  =  Ae  -I-  /o,  /o  E  Aq  we  have 
ll/ll  =  |A|  +  ||/o||. 

An  algebra  A  is  said  to  be  X -symmetric  if  for  every  /  G  A  there  exists  an  element  g  e  A 
such  that  \\g\\  ^  |1/||  and  g{x)  -  f(x)  for  all  x  e  X. 

Obviously,  the  splitting  property  is  satisfied  by  the  algebras  A  obtained  by  the  standard 
adjoining  of  unity  to  a  Banach  algebra  Aq  without  unit.  For  instance,  this  is  the  case  for  the 
group  algebra  Aq  ~  L^(G)  of  a  nondiscrete  LCA  group  G.  Also,  it  should  be  mentioned 
that  the  classical  symmetry  property  of  Banach  algebras  corresponds,  in  our  language,  to  the 
9Jl(A)-symmetry.  For  X  9K(A),  AT-symmetry  may  happen  to  be  a  considerably  weaker 
property.  For  instance,  the  algebra  A  =  TM{G)  is  obviously  6-symmetric:  for  /  =  //  G  A 
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take  g  =  /I*,  where  //♦(cr)  =  fji{—cr),a  C  G.  But  it  is  not  SDt-symmetric  for  a  nondiscrete 
LCA  group  G.  This  latter  property  is  essentially  equivalent  to  the  Wiener-Pitt  phenomenon. 

For  X-symmetric  algebras,  the  ((^  —  n)-visibility  properties  are  related  to  each  other  as 
follows. 

Lemma  2.2.2.  [N4]  For  an  X -symmetric  algebra  A,  the  (S-1  )-visibility  of  the  spectrum 
implies  the  (VS  -  n) -visibility  for  all  n  ^  1,  and  even  the  complete  y/S- visibility  with 
snp„^^Cn(VS,A,X)  ^ci{5,A,X). 

Another  feature  of  X-symmetric  algebras  is  that  one  can  always  control  the  inverses  of 
elements  whose  lower  bound  6  =  infx  |/|  is  sufficiently  close  to  the  norm  ||/||.  To  show  this 
we  use  the  following  obvious  observation. 

Lemma  2.2.3.  Let  A  be  an  algebra  splitting  at  the  unit,  and  let  f  =  Xe  +  fo,  1/2  <6  ^ 
|A|  <  ll/ll  ^  1.  Then  f  is  invertible  in  A,  and  ||/“^||  ^  (2S  -  1)“^  | 

2.2.4.  X -domination  for  the  unit  evaluation  functional.  Let  (pe  be  the  following  func¬ 
tional  evaluating  the  coefficient  of  the  unit  in  the  standard  expansion  of  an  element  of  a 
splitting  algebra: 

(2.2)  V^e(Ae  +  /o)  =  A 

for  Xe  fo  e  A  =  Oe  Aq.  Note  that  (p^  is  a  norm  1  linear  functional  on  A,  not  necessarily 
multiplicative.  The  following  definition  will  be  useful. 

Definition  2.2.5.  Let  A  =  e-C-f  Ao  be  a  direct  sum  decomposition  of  a  Banach  algebra 
A,  and  let  X  C  dJl{A).  We  say  that  the  unit  evaluation  functional  (2.2)  is  X -dominated  if 
l¥’e(/)|  <  ll/iu  for  every  f  e  A,  where  ||/||x  =  sup;^  |/|. 

Standard  Hahn-Banach  arguments  show  the  following  lemma,  where,  as  above,  means 
the  evaluation  functional  at  a  point  x  e  X:  6x(f)  =  f{x)  for  f  e  A. 

Lemma  2.2.6.  The  following  assertions  are  equivalent. 

(i)  The  functional  (p^  of  (2.2)  is  X -dominated. 

(ii)  Pe  e  conv  (4  :  a:  6  X),  where  conv  (•)  stands  for  the  weak-*  closed  convex  hull  of 
(•)•■ 

Theorem  2.2.7.  Let  A  be  a  commutative  unital  Banach  algebra  and  X  c  271(A)  such 
that  (i)  A  is  X  -symmetric;  (ii)  A  splits  at  the  unit;  and  (Hi)  is  X -dominated. 

Then,  ±espectrumof  A  is  (S-n) -visible  for  all  n  ^  1  andall6  satisfying  l/y/2  <S  ^1, 
and  even  completely  S -visible  with  Cn(S,  A,  X)  ^  {26‘^  -  1)“^ 

Proof.  Let  /  =  (/i, . . . ,  /„)  e  A”  be  such  that  6  <  \  f{x)\  <  ||/||  ^  1  for  all  a;  e  X,  and 
let  Qk  be  elements  of  A  corresponding  to  the  fk  as  in  the  definition  of  X-symmetric  algebras. 
Then  h  €'"1,  where  h  =  Y2=ifk9k,  and  1/2  <  5‘^  ^  h{x)  ^  \\h\\  ^  1  for  all  x  €  X. 
Using  condition  (Hi),  Lemma  2.2.6,  and  an  obvious  fact  that  the  interval  1]  is  a  convex 
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set,  we  obtain  ^  ^  1-  Condition  (ii)  and  Lemma  2,2.3  imply  that  h  is  invertible  and 

||ft-'||  <  (2^^  -  1)"^.  Hence, g  =  . . .,g„h-')  e  A",  Y!k=\  fk(9kh~^)  =  e,  and 

Il9ll  =  <  llft-'ll  •  ll/ll  <  (2<5^  - 1)-', 

*=i 

as  desired.  | 


2.3.  A  method  for  lower  estimates 

The  method  to  get  a  lower  estimate  for  ci((5,  A,  X)  stated  in  theorem  2.3.1  below  is  in¬ 
spired  by  Shapiro’s  example  [Shi]  mentioned  above.  Essentially,  it  reduces  to  the  existence 
of  elements  a  €  A  whose  normalized  powers  a^/||a||^,  0  ^  k  <  p,  for  a  given  p  are  e- 
equivalent  to  the  standard  basis  of  an  -space,  whereas  asymptotically  they  tend  to  zero 
faster  than  a  given  exponential.  We  use  this  method  in  Section  5;  see  also  [ENZ]. 


Theorem  2.3.1.  [N4]LetA  be  a  unital  commutative  Banach  algebra,  X  C  ^[A),  and 
given  €>  0  let  A^  be  the  set  of  all  elements  a  e  A  such  that  ||a||c(X)  <  e  l|a||  =  1. 
Suppose  that 


(2.3) 


sup 


fc=0 


A:=0 


for  all  p  ^  0,  6  >  0,  and  ^  0.  Then  Ci((5,  A,X)  ^  {2S  —  1)  ^  for  all  S,  1/2  <  6  <  1.  In 
particular,  5i(24,X)  ^  1/2. 

The  proof  consists  of  setting  ft  =  {1  (e  —  ta)  for  t  >  0  such  that  (1  -E  >  (5  and 
1/2  <  5  <  1,  then  estimating  l|/r^||  froni  below  and  maximizing  in  t.  See  [N4],  theorem 
1.5.1,  for  details. 


3.  Estimates  of  Inverses  for  Rotation  Invariant  Algebras 

In  this  Section  we  describe,  following  [ENZ],  a  general  method  permitting  to  control 
the  inverses  in  terms  of  S  inf9jt(/i)  1^1  for  rotation  invariant  topological  Banach  algebras 
A  on  the  circle  T.  In  other  words,  our  goal  is  to  give  a  method  for  proving  the  equality 
(^1(24,  dfl)  =  0.  As  mentioned  above,  the  latter  property  is  equivalent  to  a  distance  controlled 
estimate  for  resolvents,  ||(Ae  -  ^  (p(dist  (A,  (7(0:)). 

In  short,  the  main  idea  how  to  estimate  ||,  a;  €  A,  is  to  “reduce  the  smoothness”  of 
by  applying  a  first  order  differential  operator  V,  and  then  to  use  the  formula  Vx~^  - 
-x^'^Vx.  To  estimate  the  norm  of  the  product  x~^Vx  we  use  the  range  norm  ||  •  Wva  on  VA 
and  the  multiplier  norm  related  to  VA,  namely  ^  ||^^||p/4|l^”^||muit(r>/i)- 

The  second  idea  is  to  rely  on  the  compactness  of  the  embedding  A  C  mult  iVA)  to  ensure 
a  uniform  estimate  for  ||a;“^||muit(x>A)-  We  obtain  an  estimate  for  Ci((5,  A)  measuring  the  size 
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of  a  compact  subset  of  mult  (VA)  in  terms  of  the  decreasing  rate  of  the  best  polynomial 
approximations,  see  Subsection  3.2  below. 

In  Section  7  we  apply  this  method  to  Beurling-Sobolev  algebras 

A  =  F{Z,w),F{’l+,w). 


3.1.  How  to  use  multipliers 

Having  in  mind  applications  to  the  case  A  =  F{w)  (Section  7),  from  now  on  we  dis¬ 
tinguish  between  the  Banach  algebras  and  the  algebras  that  become  a  Banach  algebra  after 
equivalent  norming.  More  precisely,  a  unital  Banach  algebra  is  a  Banach  space  A  endowed 
with  a  multiplication  such  that  |lxy||  ^  ||ar||  •  ||y||  for  all  x,y  e  A,  and  ||e||  =  1,  e  stands  for 
the  unit  of  A.  A  unital  topological  Banach  algebra  A  is  a  Banach  space  A  endowed  with  a 
continuous  multiplication  so  that  \\xy\\  <  C||rr||  •  ||y||  for  all  a;,  y  €  A  and  for  some  constant 
C.  Clearly,  every  commutative  topological  Banach  algebra  is  a  Banach  algebra  with  respect 
to  the  operator  norm  ||  •  |1*, 

l|x||*  =  sup{|lxi/l|  :yeA,  lls/ll  ^  1}. 

It  is  clear  that  some  properties  of  A,  such  as,  e.g.,  the  property  (5i(A,  X)  =  0,  are  renorming 
stable.  Some  others  may  be  greatly  affected  by  such  a  renorming.  For  instance,  so  is  the  sharp 
value  of  the  critical  constant  (A,  A)  if  it  is  positive,  or,  in  the  case  where  (A,  X)  =  0,  so 
is  the  growth  rate  of  the  constants  Ci{S,  A,  X)  as  5  — >  0. 

Now,  let  A  be  a  unital  topological  Banach  algebra  of  sequences  x  —  (Xn)  on  Z  or  Z4., 
which  means  that  x  1 — >  Xn  is  a  continuous  functional  for  every  n,  with  the  convolution  *  as 
an  algebra  operation,  and  such  that 

(i)  the  set  <So  of  finitely  supported  sequences  (on  Z  or  Z+,  respectively)  is  a  dense  subset 
of  A; 

(ii)  A  is  a  rotation  invariant  (homogeneous)  space  of  sequences,  that  is,  if  x  £  A  then 
Xt  =  {xnF)n  e  Aand\\xt\\  =  \\x\\  for  every  t  £  T. 

Conditions  (i)  and  (ii)  guarantee  that  the  rotation  t  \ — y  xt  is  a  norm  continuous  mapping 
from  T  to  A,  for  every  x  £  A.  Consequently,  the  Cesaro  (Fejer)  or  Abel-Poisson  averages  of 
the  series  x  ^Y^j^XkBk  converge  to  x  for  every  x  £  A\  here  Ck  =  (<^fcn)n  is  the  standard  0  —  1 
algebraic  basis  of  Sq, 

A  complex  homomorphism  of  A  is  uniquely  determined  by  its  value  A  =  p{e\)  on 
the  generator  ei  of  the  algebra  A,  and  the  Gelfand  (Fourier)  transformation  is  given  by  the 
formula, 

1 1— f  x(A)  =  =  Y^XkX’’, 

k 

at  least  for  x  £  Sq.  Hence,  p  \ — >  A  =  y?(ei)  is  a  bijection  of  9K(A)  on  a  compact  subset  of 
C.  We  identify  this  subset  with  9!lt(A).  In  can  be  easily  seen  that  9Jt(A)  =  A(r_,  r+)  =  {z  £ 
C  :  r_  ^  |2:|  <  r+},  where  r+  =  lim„  <  00  and  r_  =  liin„  ||e_n||~^/"  >  0.  We 
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refer  to  r±  =  r±(A)  as  to  the  lower  and  the  upper  spectral  radius  of  A.  The  normalized  case, 
where  =  1,  or  r+  =  r_  =  1  in  the  case  of  Z,  will  be  the  main  one  for  us. 

3.1.1.  A  Green  type  norm.  The  operator  V  is  defined  by  Vx  =  x  e  A.  \i  maps 

A  to  Sy  the  space  of  all  sequences  on  Z  (respectively,  on  Z+).  On  the  Gelfand  transforms 
X  —  the  operator  V  acts  as  Vx  =  {Vx)\  This  is  a  formal  first  order  differential 

operator,  V  —  z-^.  In  particular,  it  obeys  the  Leibnitz  rule  for  products  V{fg)  =  fVg-\-gV /, 
at  least  for  “trigonometric  polynomials”  f^g  ^  So .  Hence,  V{x  ^y)  =  x  *  Vy  y  ^  Vx  for 
every  x^y  e  <So.  In  fact,  the  same  formula  works  for  x  e  Soy  y  €  A,  and  in  particular, 
Vx~^  =  ~x~‘^  *  Vx  for  all  x  e  Q(A)  n  5o. 

The  range  VA  is  a  Banach  space  with  respect  to  the  range  norm  =  ||a:|U,  where 

a;  €  v4,  Xo  =  0.  For  convenience  reasons,  we  add  the  unit  e  =  eo  to  VA  and  will  consider 
the  sum  A'  =  VA  +  Ce  as  the  range  space  ofV  endowed  with  the  following  range  norm 
||Ae  +  Vx\\a>  =  ||Ae  +  x\\a,  where  a:  e  A,  a:o  =  0  and  A  €  C.  Clearly,  the  space  A'  contains 
So  as  a  dense  subset,  and  is  rotation  invariant. 

It  is  shown,  see  [ENZ],  that  A  C  A'  and  every  element  x  e  So  defines  a  continuous 
convolution  operator  y  \ — )■  x^y  on  A'.  Hence,  one  can  introduce  the  convolution  multiplier 
norm  on  finitely  supported  elements  a:  6  5o  as  the  operator  norm, 

lk||muit(A0  =  sup{||a;  *  y\\A>  :  y  e  So,  \\y\\A'  ^  !}■ 

By  definition,  the  space  mult  (A')  of  (little)  convolution  multipliers  of  A'  is  the  comple¬ 
tion  of  So  with  respect  to  this  norm.  Clearly,  mult  (A')  is  a  unital  rotation  invariant  Banach 
algebra. 

Lemma  3.1.2.  Let  A  be  a  topological  Banach  algebra  satisfying  the  above  conditions 
(i)-(ii),  and  letx  e  ^(A)  fi  ^(mult  (A'))  such  that  6  =  minAeOT(A)  |^(^)|  >  0-  Then 

<  l|e|Ui5"‘  +  2||x|U  • 

3.2.  How  to  use  compactness 

3.2.1,  A  multiplier  estimate.  Lemma  3.1.2  makes  evident  the  following  sufficient  condi¬ 
tion  for  —  l)-visibility  of  the  spectrum:  given  a  topological  Banach  algebra  A  of  sequences 
on  Z,  or  Z^.,  satisfying  conditions  (i)-(ii)  of  Subsection  3.1,  and  compactly  embedded  into 
mult  (A'):  A  Cc  mult  (A'),  then  ^i{A,  9J1(A))  =  0,  and,  moreover, 

Ci(5,A,lH)<r'||e|U  +  2C(iT,) 

for  all  >  0,  where  —  A^,  A^  =  {a;  C  A  :  ||a;||^  ^  1,  |:r{A)|  ^  S  for  all  A  G  9?1(A)}  and 
C{Ks)  is  defined  by  the  formula 

(3.1)  C{Ks)  =  sup{||a:-‘||^„„(^,,  :  x  e  Kg}  <  oo. 

It  remains  to  estimate  the  constant  C{Ks)  of  formula  (3.1). 
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3.2.2.  Approximate  characteristics  of  compact  sets.  We  use  the  standard  classification 
of  compact  sets  in  terms  of  the  best  polynomial  approximations.  Let  5  be  a  Banach  space 
and  let  Ln  C  B  he  subspaces  of  B  such  that  C  Ln+u  dimLn  <  oo  for  n  ^  1,  and 
clos  (U^  Ln)  =  B.  Further,  let  C  S  and 

(3.2)  en{K)  =  €n{K,  B)  =  supjdist  (x,  Ln)  :  x  e  K} 

be  the  best  approximations  of  K  by  elements  of  It  is  well  known  that  a  bounded  subset 
K  C  B  is  relatively  compact  if  and  only  (/’lim„e„(iC)  =  0.  We  apply  this  criterion  to 
the  space  B  =  mult  (A')  and  to  subspaces  ==  Vn  of  all  trigonometric  polynomials  of 
degree  less  than  or  equal  to  n.  Then  en{K)  of  (3.2)  are  the  best  polynomial  approximations 
of  elements  of  K.  The  axioms  (i)  and  (//)  of  Subsection  3.1  and  the  definition  of  mult  [A’) 
imply  that  lim^  -  x\\b  =  0  for  all  a;  G  B,  where  stands  for  the  Fejer  mean  of  a 
sequence  x  e  B,  Therefore,  we  can  use  the  above  compactness  criterion  for  B  =  mult  (A'). 

Now,  our  aim  is  to  specify  constants  C{Ks)  from  formula  (3.1)  in  terms  of  {Ks),  n  ^  1. 

3.2.3.  Calderon- Cohen-Dyn’kin  “constructive  inversions”.  As  mentioned  in  the  be¬ 

ginning  of  Section  3,  the  use  of  compactness  for  estimates  of  inverses  was  started  with  various 
“constructive  proofs”  of  the  classical  Wiener-Levy  inversion  theorem  for  absolutely  conver¬ 
gent  Fourier  series.  Here  “constructive”  means  “without  using  the  Gelfand  theory  of  maximal 
ideals”.  The  basic  A.  Calderon  proof,  [Z]  Chapter  VI,  theorem  (5.2),  exploits  the  Cauchy 
formula,  and,  thus,  is  applicable  not  only  to  inverses  x~^,  but  also  to  compositions  pox. 
P.  Cohen  [C]  used  the  same  techniques  for  inverses  and  for  higher  Bezout  equations.  In  [C],  a 
possibility  to  estimate  the  norms  ||vk  in  terms  of  =  infi-  \J^x\  and  certain  characteristics 
similar  to  the  quantities  of  (3.2)  was  mentioned  explicitly;  here  and  below  the  symbol  Tx 
is  used  for  the  Gelfand  (discrete  Fourier)  transform,  keeping  the  notation  x  for  the  classical 
Fourier  coefficients.  In  [D],  E.  Dyn’kin  applied  Calderon’s  method  to  the  Beurling  algebras 
A  =  /^(Z,  w)  to  get  estimates  for  the  inverses  in  terms  of  the  same  characteristics 

e„({a;}).  D.  Newman  [New]  gave  a  completely  elementary  proof  of  the  Wiener-Levy  theorem 
also  based  on  polynomial  approximations. 

All  authors  mentioned  above  obtained  some  estimates  for  the  inverses  assuming, 

or  implicitly  assuming,  that  x  runs  over  some  compact  subset  K  <Z  A.  The  approach  of  [ENZ] 
is  different:  we  prove  the  compactness  of  the  set  =  {x  G  A  :  ^  1,  \J^x\  5}  in 

the  algebra  mult  (A^)  and  use  this  compactness  for  obtaining  a  uniform  estimate  of 
for  X  G  As.  The  next  theorem  is  proved  in  [ENZ]  using  the  Dyn’kin  method  from  [D].  The 
latter  paper  contains  a  similar  result  for  the  special  case  of  the  algebra  B  =  with 

w{n)  —  w(—n)  and  r_  =  r+  ===  lim^  u;(n)^/”  —  1. 

Theorem  3.2.4.  Let  B  be  a  convolution  Banach  algebra  on  Z+,  or  on  Z,  satisfying 
conditions  (i)  and  (ii)  of  Subsection  3.1,  with  the  spectral  radius  r+ (respectively  spectral 
radii  r-  <  r+).  Let  K  be  a  relatively  compact  subset  of  Bs  =  {x  £  B  :  ||a:||j3  <  1,  and 
6  ^  \3^x{C)\  for  C  G  ^(B)}  and  let  A  =  Ax  be  the  distribution  function  of  the  sequence 
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XK{t)  -  card  {n  :  n  ^  1,  en-i(K)  >  ^},  t>  0. 


Then 

||x-ie  ^  M(S,X)  =  |^(r;^||e^||a  +  ri||e_,.|U)e-«/‘^W4) 

j^O 

for  every  x  £  K. 


For  the  case  of  an  algebra  on  Z  satisfying  r_  =  r+  “  1,  the  proof  starts  by  using  the 
Calderon  approach:  we  choose  a  polynomial  Ty  e  Vn,  n  -  X{5/4)  with  ||a:  -  vWb  ^  ^/4 
and  write  the  Cauchy  formula 

x~'^  =  (27ri)”^  /  (y  +  ze)~^{ze  f-  (y  -  dz, 

J\z\=^5/2 

where  (y  +  2;e)  e  G(B)  since  \!Fy-fz\  ^  <5/4  on  the  space  To  estimate  ||(y  +  2:e)“^||B, 

we  use  Dyn’kin’s  method  involving  the  S.  Bernstein  inequality,  see  [ENZ]  for  details. 

Now,  having  in  mind  applications  to  the  Beurling-Sobolev  algebras  in  Section  7,  we  com¬ 
bine  3.2.1  and  theorem  3.2.4. 


Theorem  3.2.5.  Let  A  be  a  convolution  topological  Banach  algebra  on  Z  or  on 
satisfying  conditions  (i)-(ii)  of  Subsection  3.1  and  compactly  embedded  into  B  =  mult  {A')^ 
that  is  A  Cc  B  =  mult  (A').  Let  constants  C  andE  be  defined  by  the  inequalities  ||a:  *  < 

C'll^^lUllylU  and||a;||B  ^  ^\\A\a  forallx,y  C  A,  and  let  €n{Ao,  B)  be  the  best  polynomial 
approximations  of  the  unit  ball  Aq  C  A,  and  Aq  =  be  their  distribution  function. 

Then  Si{A,  9Jt(4))  =  0,  andci(6,  A,  m{A))  ^  +  M{5)  for  all  S  >  0,  where 


4.  Spectral  Hulls  and  Norm-Controlled  Functional  Calculi 

In  the  three  preceding  sections,  we  considered  the  problem  of  uniform  upper  bounds  for 
inverses  when  staying  on  a  given  subset  X  C  1171  of  the  maximal  ideal  space  DJI.  Following 
[N4],  now  we  treat  a  more  general  form  of  the  same  problem  defining  and  studying  the  so- 
called  X-spectral  efficiency  hull  h(a,X)  of  a  given  set  a  C  C.  In  this  language,  the  uniform 
boundedness  of  inverses  is  equivalent  to  the  property  0  ^  h(as,X),  where  as  stands  for  the 
annulus  {2:  G  C  :  (5  <  <  1}.  Yet  another  reason  to  study  the  hulls  h(a,X)  is  that 

h(a,  X)  is  the  minimal  set  satisfying  the  “uniform  calculus  property”.  The  latter  means  that 
for  every  open  set  C  C  containing  h(a,  X)  there  exists  a  constant  k  =  k{Q,X)  such  that 
||/(a)||  <  ^ll/lh  for  every  /  e  Hol{Q)  and  for  every  a  e  A  with  a{a)  C  a  and  ||a||  ^  1. 
Similar  uniformly  bounded  calculi  were  implicitly  involved  in  classical  studies  of  functions 
operating  on  Fourier  algebras,  see  [HKKR],  [Rul]. 
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The  main  results  of  this  section  are  theorems  4.1.5  and  4.2,2.  Examples  of  spectral  hulls 
are  gathered  in  subsection  4.3. 

4.1.  Spectral  hulls  and  resolvent  majorants 

Let  A  be  a  unital  Banach  algebra,  and  let  X  C  Tl{A). 

Dehnition  4.1.1.  Let  a  C  D.  We  set  ^(cr;  X)  =  {a  e  A  :  ||a||  <  1,  d(X)  C  cr}, 
C(A,cr;X)  =  C{X^a;A^X)  —  sup{||(Ae  —  a)“^||  :  a  €  A{a;X)}  for  A  G  C,  where  we  take 
II (Ae  -  a)-^l|  =  oo  for  A  €  a{a).  Let  also  h(a]X)  =  h{a;A,X)  =  {A  €  C  :  C(A,a;X)  = 
oo}. 

The  set  /i (a;  X)  is  called  the  X -spectral  hull  of  a \  full  spectral  hull  of  a  is  h{cr)  — 
A,  3K(yl)).  The  complement  p{a;X)  —  C\h(a;X)  is  called  the  norm-controlled  (or 
efficient)  resolvent  complement  of  a.  For  a  positive  constant  A:  >  0,  we  also  consider  the  sets 
h[(j,k\X)  =  {A  G  C  :  C[\(j)  >  k]  andp(a,A:;X)  =  C\%,A;;X).  For  X  -  dn{A) 
we  simplify  the  notation  in  a  natural  way:  A{a)  =  A[a\VfK{A)),  h{a^  k)  =  /i(cr,  k;  9!Jt(^)), 

p{(j,k)  =  p((7,fc;9K(yl)). 

4.1.2.  First  properties.  It  is  clear  that  the  definition  of  the  (<5  —  l)-visibility  (Section  1) 
is  a  special  case  of  those  of  the  efficient  resolvent  complement.  Indeed,  let  be  an  annulus, 
08  —  {z  ^  C:  6  ^  1^:1  ^  1}.  Then  the  spectrum  of  is  (5  —  l)-visible  if  and  only  if 
0  G  p{(Ts\X),  Moreover,  ci[6,A,X)  =  C{0,as]A,X),  0  <  5  <  1. 

Note  that  n-variables  counterparts  of  A{(j\  X),  C(A,  cr;  X),  etc.,  could  be  considered  as 
well,  but  we  restrict  ourselves  to  the  case  n  =  1. 

The  following  property  of  ^(A,  cr;  A,  X)  and  h{a,  k\  A,  X)  is  a  more  or  less  straightfor¬ 
ward  consequence  of  the  definitions,  see  [N4]  for  the  proofs:  h{a;X)  ~  cr  for  every  closed 
cr  c  P  if  and  only  if  (5i  {A^X)  =  0,  where  5i  (A,  X)  is  the  critical  constant  for  the  pair  (A,  X). 

4.1.3.  Microlocalization.  It  is  clear  that  lower  estimates  for  spectral  hulls  and  resolvent 
majorants  must  depend  on  the  geometry  of  the  subset  a  C  P  under  consideration.  For  in¬ 
stance,  A(cr;  X)  =  {const}  for  every  subset  X  C  9!K(A)  equipped  with  an  analytic  structure 
and  for  every  cr  with  int  (cj)  =  0,  see  4.3.1  for  examples.  Having  in  mind  this  last  constraint, 
we  restrict  ourselves  to  the  case,  where  o  =  clos  (int  (cr)). 

In  this  case,  the  behaviour  of  (7(A,  cr;  X)  depends  on  the  following  microlocal  version  of 
the  critical  constants  and  the  inversion  majorants. 

Let  A  be  a  unital  Banach  algebra,  X  C  9!Jt(A),  and  let  0  <  5  ^  1.  A  microlocal  upper 
bound  {majorant)  for  inverses  is  defined  by 

cl{6,A,X)  =  inf{sup{||/“‘||  :  1|/||  <  1,  f{X)  C  ctj,  diam/(X)  <  e}), 

and  the  microlocal  critical  constant  is  defined  by  (5j(A,  X)  =  inf{(5  :  c5((5,  A,  X)  <  oo); 
here,  as  before,  =  {2:  G  C  :  J  <  1;2|  <  1). 

Properties  of  these  microlocal  majorants  are  similar  to  those  of  the  global  ones,  that  is,  to 
the  properties  of  Ji(A,  X)  and  ci((5,  A,  X)  defined  in  Sections  1  and  2;  see  [N4]  for  details. 
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4.1.4.  Horodisc  expansions.  Denote  D(2;,  r)  =  {(  €  C  :  -  2:|  <  r},  D{z,r)  —  {C  ^ 

C  :  IC  -  ^1  ^  In  hyperbolic  geometry  of  the  unit  disc  D,  the  discs  D(z,  1  -  |2:|),  ^  €  D 
are  called  horodiscs  .  Given  a  closed  set  cr  C  P,  we  call  the  set 

hor(CT)  =  (J(I5(2:,1  -  |2:i)) 

z^a 

the  horodisc  expansion  of  a.  In  order  to  apply  the  microlocal  version  of  the  critical  constants 
we  need  one  more  notation,  namely, 

hor(a,a)  =  (J(:D(.,^(l-|z|))). 

zea- 

Theorem  4.1.5.  Let  A  be  a  unital  splitting  Banach  algebra,  X  be  a  subset  ofM{A) 
dominating  (pe,  and  let  a  =  clos  (int  (a))  C  P.  Then 

(i)  hor  (a,  C  h{a;  A,  X)  C  hor  (a,  1/2)  =  hor  (a); 

(ii)  if  (A,  X)  =  1/2,  then  h(a;  A,  X)  =  hor  (a); 

(Hi)  ifSi(A,  X)  =  1/2  and  cj{(5,  A,  X)  =  (26  -  1)“^  (see  Section  5  for  examples),  then 
h(cr;  A,  X)  =  hor  (cr)  and  X)  —  1/dist  (A,  hor  (cr)). 

Observe  that  under  the  splitting  condition  we  always  have  (^4,  X)  ^  X)  ^  1/2 
and  hor  (cr,  (5f)  C  hor  (cr,  1/2)  =  hor  (cr).  The  equality  hor  (cr,  S^)  =  hor  (a)  holds  for  all 
a  C  P  if  and  only  if  S^{A,  X)  =  1/2.  For  examples  of  computations  and  for  pictures  of  the 
horodisc  expansions  of  various  sets  see  [N4]. 

4.2.  Spectral  hulls  and  norm-controlled  calculi 

The  Gelfand  theory  guarantees  the  existence  of  a  holomorphic  calculus  on  the  spectrum 
of  every  element  of  a  Banach  algebra  A.  Namely,  ifaGA,  the  function  of  a 

=  A  /  /(A)(Ae-o)-'dA 

Jan 

is  well  defined  for  every  /  €  Hol{Q,)  and  every  open  neighbourhood  of  the  spectrum  Q  D 
a{a)  (the  Riesz-Dunford  calculus).  As  is  shown  in  Sections  1,  2,  and  5,  this  does  not  guar¬ 
antee  any  estimate  of  the  norm  ||/(a)||,  even  for  the  simplest  functions  like  f{z)  =  1/z,  and 
even  if  we  add  the  normalizing  condition  ||a||  ^  1  to  the  spectral  inclusion  cr(a)  C  fl. 

The  true  question  is  the  following:  what  are  the  relations  between  the  spectrum  (j{a)  (or 
a  visible  part  of  the  spectrum  a(X))  and  a  domain  Q  that  guarantee  the  uniform  continuity  of 
the  n-calculus?  In  other  words,  does  there  exist  a  compact  set  K  C  Ll  and  a  constant  c  >  0 
such  that  ||/(a)||  <  c  •  sup;^  |/|  for  every  /  G  Hol(fl)  and  every  a  €  A(cr;X)  =  {a  G  A  : 
d(X)  C  cr,  ||a||  ^  1}?  We  formalize  this  setting  in  the  following  definition,  see  [N4]  for 
more  details. 
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Definition  4.2.1.  Let  A  and  X  be  as  above,  and  let  cr  =  a  c  ID.  We  say  that  an  open 
set  ri  C  C  X -dominates  the  set  cr,  or  is  a  norm-controlled  calculus  domain  for  a,  if  a  C  H 
and  the  calculi  /  i — >  /(fl),  /  €  Hol{Q)  are  well  defined  and  uniformly  continuous  for  all 
a  e  A{a;X)  —  {ae  A  \  d{X)  C  a,  ||a||  <  1}. 

In  fact,  this  property  is  equivalent  to  the  existence  of  a  compact  set  iif  c  and  a  constant 
c{K)  —  c{K,  X)  >  0  such  that 

||/(a)HcWi|/|k 

for  every  function  /  G  Hol{Q)  and  every  a  G  A{a;  X).  Here  ll/llif  =  max{|/(z)|  :  ze  K}. 

Obviously,  the  definition  of  (S  —  l)-visibility,  as  well  as  the  definition  of  the  distance 
controlled  resolvent  growth,  see  Subsection  1.1,  are  special  cases  of  the  latter  concept.  The 
following  theorem  shows  that  X-dominating  domains  for  a  given  set  a  can  be  described  in 
terms  of  the  spectral  hulls  h{a;X). 

Theorem  4.2.2.  Let  A  be  a  unital  Banach  algebra,  X  c  Wl{A),  and  let  a  C  B  and 
17  C  C  be  a  closed  and  an  open  set,  respectively.  The  following  assertions  are  equivalent 

(i)  is  an  X  -dominating  domain  for  a. 

(ii)  17  D  h{a;  X). 

(Hi)  There  exists  k  >  0  such  that  17  D  h{a,  k]X). 

4.3.  Examples  of  spectral  hulls 

Here  we  describe  examples  of  three  different  types.  As  above,  we  refer  to  [N4]  for  details. 

The  first  type  pertains  to  algebr^  A  whose  hull  operation  is  trivial  in  the  sense  that 
h{a\  9!3t(A))  =  a  for  every  cr  =  a  C  B;  see  4.3.5  below. 

For  algebras  A  of  the  second  type,  the  same  operation  a  \ — >  h{a;  S!?l(A))  is  also  trivial, 
but  in  a  different  way,  namely,  h{a;  =  B  for  every  nonempty  cr  =  a  C  B,  even  for 

singletons;  see  4.3.4  below. 

For  the  middle  type  algebras  the  full  spectral  hull  h(a\  M(A))  essentially  depends  on  a 
but  is  different  from  it.  For  instance,  this  is  the  case  for  the  measure  algebras  A  =  M{S)  on 
semigroups  considered  in  Section  5  below.  In  this  case  we  can  compute  completely  the  full 
spectral  hulls  h{a]  St)  and  the  resolvent  majorants  C(-,  <t;  S^),  see  theorem  4.3.2  below. 

4.3.1.  Spectral  hulls  for  measure  algebras  on  semigroups.  Anticipating  the  systematic 
study  of  measure  algebras  (see  Section  5  below),  here  we  apply  the  above  theory  to  compute 
relevant  spectral  hulls.  Let  M  (S)  be  the  convolution  algebra  of  measures  on  a  sub-semigroup 
of  a  locally  compact  abelian  group  G,  and  Sf,  be  the  set  of  all  bounded  semi-characters  on  S 
which  we  consider  as  the  visible  part  of  the  spectrum  of  M  (S)  (see  Section  5  for  details). 
The  corresponding  Gelfand  (Fourier-Laplace)  transformation  is  denoted  by  T.  For  example, 
the  analytic  Wiener  algebra  A  =  TMifLjf)  =  Tl\(ffL^  =  W+  corresponds  to  5  =  Z.,. 
and  Sb  =  9?l(A)  =  B,  with  /  i — >  f{z),z  G  B,  as  the  Gelfand  transformation.  Since  a 
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nonconstant  holomorphic  function  is  an  open  mapping,  the  sets  A{a)  =  v4(cr,  D)  =  {/  e 
W+  :  ll/ll  ^  1,  /(P)  C  a}  and  A{(j'),  where  a'  —  clos  (int  (cj)),  differ  from  each  other  by 
constant  functions  taking  values  in  a\a'.  Hence,  we  can  restrict  ourselves  to  the  case,  where 
cr  —  cr'.  Supposing  a  =  we  can  easily  deduce  from  theorem  4.1.5  a  description  of  h[a\  S^) 
for  A^(5),  as  well  as  for  all  subalgebras  of  M{S)  considered  in  Section  5. 

Theorem  4.3.2.  [N4]  Let  A  =  or  let  A  be  any  algebra  A  c  M(S)  sat¬ 

isfying  the  conditions  of  theorem  4.3.4  below.  Let  a  be  a  closed  subset  of  D  such  that 
o  =  clos  (int  (a)).  Then 

(i) h{a;Sb)  =  hor(t7); 

(ii)  C(A,  a;  Sb)  =  1/dist  (A,  hor  (a))  for  A  G  C; 

(Hi)  An  open  set  Q  is  a  norm-controlled  calculus  domain  fora  if  and  only  ifQ  D  hor  (a). 

Indeed,  assertion  (Hi)  is  a  straightforward  consequence  of  (i)  and  theorem  4.2.2.  Asser¬ 
tions  (i)  and  (ii)  are  special  cases  of  theorem  4.1.5,  because  (A,  -§6)  =  1/2  and  (pdf)  = 
/(O)  G  f{Sb)  for  every  /  G  M{S).  | 

4.3.3.  Full  spectral  hulls  equal  to  D.  Here  we  describe,  following  [N4],  axlass  of  “bad” 
Banach  algebras  for  which  the  full  spectral  hull  of  every  spectrum  is  equal  to  D.  To  this  end, 
we  need  a  bit  of  model  operators.  All  properties  claimed  below  can  be  found  in  [Nl]. 

Let  0  be  a  singular  inner  function  in  P,  and  let 

©  =  0/x  =  exp{  f  dffiO),  2;  G  P, 

f  T  ^  ~  S 

be  its  canonical  integral  representation,  where  p  is  a  positive  measure  on  T  singular  with 
respect  to  the  Lebesgue  measure  m.  Further,  let  Kq  —  be  the  orthogonal  comple¬ 

ment  of  the  corresponding  ^-invariant  subspace  0if^  of  the  Hardy  space  The  compres¬ 
sion 

f^Mef  =  Pezf,  feKe. 

of  the  multiplication  operator  is  called  a  model  operator;  Pq  stands  for  the  orthogonal  pro¬ 
jection  to  Kq.  The  Sz.-Nagy-Foias  model  theory  tells  that  every  completely  non  unitary 
contraction  T  with  rank(l  -  T*T)  ~  1  and  a(T)  P  is  unitarily  equivalent  to  an  operator 
Mg,  see  [SzNF]. 

Now,  we  define  the  algebra  A  =  Ae  as  the  norm  closure  of  polynomials  in  Mq.  It 
is  an  exercise  to  show  that  always  ^)yl{Ae)  —  a{Me)  =  snpp{p)  C  T  and  ||Me||  =  1. 
It  is  worth  mentioning  that,  for  0  =  0^,  where  p  =  6i,  there  exists  a  unitary  operator 
U  :  Kq  — )■  L2(0,  1)  such  that  the  algebra  UAqU~^  is  the  norm  closure  of  polynomials  in 
the  integration  operator  J f{x)  =  f{t)  dt,  x  G  (0, 1),  on  the  space  L^(0, 1). 

Theorem  4.3.4.  [N4]  Let  A  =  Aq  be  the  above  Banach  algebra  corresponding  to  a 
singular  inner  function  0  =  0^  with  the  representing  measure  p  whose  closed  support  is  a 
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setofzeroLebesgueme^ure,  m(supp  (/i))  =  0.  Then  for  every  closed  subset  a  C  D,  a  ^  0, 
we  have  h{a]  Tt{A))  =  D. 

4.3.5.  Spectrally  closed  sets.  It  is  natural  to  call  a  subset  a  C  C  (A,  X)-spectrally  closed 
(or  A-spectrally  closed  in  the  case_where  X  =  dfl{A))\fh{a\  A,  X)  —  a.  Property  4.1.2  tells 
us  that  every  closed  subset  cr  C  D  is  (A,  X)-closed  if  and  only  if  (5i(A,  X)  ~  0.  In  Section 
7  we  give  an  account  of  known  results  about  weighted  Beurling-Sobolev  algebras  satisfying 
this  property. 


Part  II.  Convolution  Algebras 

5.  Unweighted  Convolution  Algebras  on  Groups  and  Semigroups 

Let  G  be  an  LCA  group  and  M{G)  be  the  convolution  algebra  of  all  complex  Borel 
measures  on  G  endowed  with  the  standard  variation  norm  ||/x||  Var{/^).  The  Fourier 

transforms  Tp  —  p, 


=  /i(7)  =  f  {-X,  7)  dn{x),  7  e  G, 

Jg 

form  an  algebra  of  functions  on  the  dual  group  G.  We  denote  it  by  TM{G)  and  endow  it 
with  the  range  norm  \\J^p\\  =  ||/i||.  Clearly,  T{p  *  u)  =  {J^p)  •  {Tu)  for  all  //,  1/  6  M{G), 
and  so  the  Banach  algebras  A4(G)  and  TM{G)  are  isometricily  isomorphic.  The  term  “the 
measure  algebra  of  G”  will  be  referred  to  the  both. 

Since  the  mapping  p  \ — y  p{'y)  is  a  complex  homeomorphism  of  M{G),  we  can  injec¬ 
tively  embed  G  into  and  regard  G  as  the  visible  spectrum  of  Ai{G)  in  the  sense 

of  the  Introduction.  In  particular,  dos  p{G)  C  g[p)  for  any  p  e  M{G).  The  Wiener-Pitt- 
Sreider  theorem  implies  (see  the  Introduction)  that  f)Jfl{M{G))  =  clos  G  if  and  only  if  G  is  a 
discrete  group;  in  fact,  in  this  case  we  simply  have  dJll{M{G))  =  G. 

However,  even  in  the  latter  case,  the  problem  of  the  norm-controlled  inversion,  as  de¬ 
scribed  in  previous  sections,  is  still  of  interest.  Moreover,  from  the  quantitative  point  of 
view,  we  cannot  distinguish  any  advantage  of  discrete  groups,  for  which  the  spectrum  G  = 
^ffi{M{G))  is  completely  visible  (in  the  sense  of  Definition  1.2.1),  as  compared  with  the  gen¬ 
eral  LCA  groups,  for  which  X  ^  G  ^  dyi{M{G)),  and  the  spectrum  is  even  1-invisible. 
This  is  an  essential  distinction  between  the  concepts  of  n-visibility  (without  any  norm  con¬ 
trol)  and  {6  ~  n)- visibility.  Speaking  informally,  the  1-visibility  of  f)Jl{M{G))  for  discrete 
groups,  guaranteed  by  the  classical  Wiener-Levy  theorem,  is  illusory  because  it  does  not 
endure  quantitative  specifications  by  (S  -  l)-estimates  of  inverses. 
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5.1.  An  upper  estimate  for  the  measure  algebra  on  a  group 

The  constants  Cn{6,M{G),  G),  defined  in  1.2.3,  can  be  estimated  by  using  theorem  2.2.7, 
because  the  algebra  M[G)  is  G-symmetric  in  the  sense  of  2.2.1.  The  main  condition  (Hi) 
of  theorem  2.2.7  can  be  checked  with  the  help  of  lemma  2.2.6.  Namely,  one  can  prove  that 
(fe  €  conv  {S^  e  G)  by  means  of  “triangular”  positive  semidefinite  kernels  on  G,  such 
that  ka{y)  >  0  for  7  e  G,  0  <  ^0(3;)  ^  1  =  fea(O)  for  x  eG,  and  kaix)  =  0  outside  of  a 
neighborhood  14  of  the  origin  such  that  fja  ^4  =  {0}-  Indeed,  setting 

Jg 

v/c  have  (fa  ^  conv  :  7  G  G)  and  lima  ^ain)  =  fc  dp  =  p{{0})  ^  (pe{p)  for  all 
pe  M{G).  This  proves  the  following  theorem. 

Theorem  5.1.1.  For  every  LC A  group  G,  the  spectrum  of  M{G)  is  completely  6 -visible 
for  every  6  satisfying  ^  ^  consequently,  <5„(A^(G),G)  ^  V2  for  all  n  ^  1. 

Moreover, 

(5.1)  c„(d,Ad(G),G)<^^ 

for  1  /y/2  <6^1  and  for  alln^  1. 

5.2.  The  measure  algebra  on  a  semigroup 

Here  we  consider  the  convolution  measure  algebras  jV((S)  on  semigroups  5.  Since  the 
language  of  the  semigroup  theory  is  not  canonically  fixed,  we  define  exactly  which  objects 
we  are  dealing  with.  For  general  facts  of  harmonic  analysis  on  semigroups  we  refer  to  [T]. 

5.2.1,  Definitions.  By  a  semigroup  S  we  mean  the  following. 

(i)  S  is  a  Borel  subset  of  a  LCA  group  G  such  that  x,y  €  S  =>  x  +  y  e  S, 

(ii)  0  G  5. 

A  bounded  character  (also  called  semicharacter)  on  5  is  a  bounded  continuous  function 
7  :  S  — ^  C  such  that  7(0)  1  and  7(0:  y)  =  l{x)f{y)  for  all  x,y  e  S.  It  is  clear  that 

every  such  function  is  bounded  by  1:  l7(a:)|  <  1,  a;  G  S'.  The  set  of  all  bounded  characters 
of  5  is  denoted  by  S^.  Obviously,  G  C  56,  in  the  sense  that  the  restriction  7|5  of  a  character 
7  G  G  is  a  bounded  character  of  5.  In  what  follows,  we  assume  that  the  following  separation 
property  holds. 

(fi7)  For  every  a:  €  S,x  ^  t),  there  exists  a  bounded  character  y  G  St  such  that  \f{x)\  <  1. 

In  particular,  5  D  (-5)  =  {0}  if  a  semigroup  5  satisfies  condition  (Hi).  Let  M[S)  = 
[p  G  A^(G)  :  p\{G\S)  =  0}  be  the  subspace  of  A^(G)  consisting  of  all  measures  supported 
by  5.  An  immediate  verification  shows  that  Ad (5)  is  a  (closed)  subalgebra  of  M{G).  Now, 
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we  define  the  Fourier-Laplace  transformation  on  Sb  setting  Cp{'y)  =  jl{f)  =  'y(x)  d/2(x) 

for  p  e  M(S)  and  7  G  Sf,.  Clearly,  the  functional 

(5.2)  /2GM(S), 

is  a  norm  continuous  homomorphism  of  the  algebra  M(S)  for  every  j  E  Si,.  Moreover, 
jCp('y)  =  7)  for  all  j  E  G.  Hence,  it  is  natural  to  consider  the  space  of  bounded 

characters  Si,  as  the  visible  part  of  SPT(A^(S')),  and  write  Si,  C  97l{Af  (iS)). 

The  following  theorem  can  be  proved  in  a  similar  way  to  5. 1 . 1 .  However,  the  result  is  two 
times  better  as  compared  with  the  algebras  M[G). 

Theorem  5.2.2.  Let  S  be  a  semigroup  satisfying  conditions  (i)-( Hi).  Then, 

(5.3) 
and 

(5.4)  Ci(5,A4(5),Si)^  2^ 
for  all  5,  ^  <  6  ^  1. 

5.3.  Examples  and  comments 

5.3.1.  Symmetric  and  analytic  Wiener  algebras.  Let  G  =  Z  be  the  additive  group  of 

integers,  and  let  5  —  =  {n  G  Z  :  n  ^  0}.  Then  A^(Z)  =  /^(Z),  >f(Z+)  = 

and  W  =  FM{Z)  —  {p.  ==  C  1^{Z)}  is  the  Wiener  algebra  of  absolutely 

converging  Fourier  series  on  the  circle  group  Z  =  T  =  {(gC:|C|  =  1}.  The  bounded 
characters  of  Z+  fill  in  the  closed  unit  disc  55  =  D={;2:GC:  |^|  ^  1}.  The  corresponding 
Fourier-Laplace  transformation  is  p  \ — >  p{z)  —  “  LM(Z^)  = 

{p  =  p{n)z^  :  p  E  1^{Z+)}  is  the  analytic  Wiener  algebra  on  D.  Preceding  theorems 
tell  that  whWw  ^  (25^  -  1)“'  if  /  6  W  and  1/v^  <  6  <  |/(C)1  <  ||/||w'  <  1  for  |C|  =  1, 
and  l|/-i||  <  {25  -  l)-i  if  /  €  and  l/x/2  <  <5  ^ \f{z)\  ^  \\f\\w^  ^  1  for  |^1  <  1. 

5.3.2.  Several  variables,  continuous  versions,  and  cones  in  Z^  and  The  same 
estimates  of  inverses  as  in  5.3.1  hold  for  the  multivariate  Wiener  algebra  on  the  torus  T^, 

W  =  FM(Z^)  =  {p=  p(n)C  :  li  € 

— N 

for  the  analytic  Wiener  algebra  on  the  polydisc  ID)  , 

=  CM{Z^)  =  {A  =  ^  n(n)z’^  :  //  6  /'(Z^)}, 

nez^ 
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and  for  continuous  versions  of  the  Wiener  algebras.  The  latter  correspond  ioG  —  R,S  — 
IR4.  —  {a;  G  R  :  2:  ^  0},  and — in  several  variables — to  G  =  5  =  .  Here  G  =  R, 

5{,  =  C+  =  {2:  G  C  :  Im  (2;)  ^  0}  -  the  closed  upper  half-plane,  and,  for  several  variables, 
G  =  5^  =  C+ . 

Instead  of  the  cones  R^  C  R^  and  -  R^^  n  Z^,  we  may  consider  an  arbitrary 
subsemigroup  S  of  R^  containing  0  and  such  that  5\{0}  is  contained  in  an  open  halfspace 
(this  requirement  guarantees  the  separation  property  (in)  of  Subsection  5.2.1).  Usually,  S  is 
a  convex  cone  satisfying  the  latter  property,  or,  in  its  discrete  version,  the  intersection  of  such 
a  cone  with  Z^.  For  the  continuous  version,  the  semicharacters  are  x  1 — >  with  z  G  S^, 

where  ^6  =  {.s  G  :  Im  (a;  *  2)  ^  0  for  all  a;  G  5}  and  a:  •  2:  =  XkZk.  The  Fourier- 
Laplace  transformation  is  again  the  classical  one.  For  instance,  taking  S  =  {(a;i,  2:2)  G  R^  : 
Xi  ^  0,  -kxi  ^  2:2  ^  kxi},  where  /:  >  0,  we  have  St  =  {(21,^2)  €  :  k\\in  (22)!  ^ 

Im(2i)}. 

Another  example  is  the  halfspace  S  =  {(a;i,a:2)  G  R^  :  Xi  >  0}u{0},  with  the  corre¬ 
sponding  dual  set  of  characters  Sb  ~  {(zi,Z2)  G  C?  :  ^(2:2)  =0,  Im  (21)  ^  0}  —  C+  x  R. 

5.4.  Some  subalgebras  ofM.(G)  and  M{S) 

Here  we  briefly  consider  two  classical  subalgebras  of  A^(G)  (or  At (5)),  namely,  the 
algebras  of  absolutely  continuous,  respectively,  discrete  measures  on  G  (or  on  S). 

5.4.1.  The  group  algebra  L^{G),  Let  G  be  an  LCA  group,  and  L^{G)  the  convolution 
group  algebra  on  G,  which  becomes  a  unital  algebra  if  we  adjoin  the  Dirac  point  mass  at  the 
origin  e  =  (if  G  is  not  discrete).  Since  the  Riemann-Lebesgue  lemma  implies  that  Peiff)  = 
A  =  lim.y__^oo  Kf)  for  p  =  Xe-f  fdm  G  (L^(G)  +  Ce),  we  can  directly  apply  lemma  2.2.3 
and  obtain  the  following  improvement  of  theorem  5.1.1:  for  any  nondiscrete  LCA  group  G, 
one  has  6i{L^[G)  +  Ce,G)  ^  1/2;  moreover,  Ci((^,  L^{G)  +  Ce,  G)  ^  (2(^-1)"^  for  all 
1/2  <  5  ^  1.  (For  n  ^  2  we  still  have  estimates  (5.1)).  In  Subsection  5.5  we  show  that  this 
new  estimate  for  V- (G)  +  Ce  is  sharp. 

5.4.2.  The  algebra  of  almost  periodic  functions  TM d{G),  and  the  algebra  of  Dirich- 
let  series  CMd{S).  Let  Md(G)  be  the  algebra  of  discrete  measures  on  an  LCA  group  G,  and 
Md{S)  be  its  subalgebra  of  measures  supported  by  a  subsemigroup  S  satisfying  hypotheses 
(i)-(iii)  of  Subsection  5.2.1. 

The  algebra  TMd{G)  of  Fourier  transforms  of  discrete  measures  is  the  algebra  of  almost 
periodic  functions  with  absolutely  convergent  Fourier  series.  Theorem  5.1.1  works  for  this 
algebra  too  (the  visible  spectrum  is,  of  course,  X  =  G).  As  for  the  entire  algebra  A1(G), 
we  will  see  in  Subsection  5.5  below  that  6i{XMd{G),G)  ^  1/2.  It  should  be  mentioned 
that,  as  before,  impossibility  of  the  norm  control  of  inverses  for  small  J,  i.e.,  the  fact  that 
ci((^,  Md{G),G)  =  00  for  0  <  (5  ^  1/2,  is  not  related  to  the  evident  fact  that  the  spectrum 
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~  (G)  (the  Bohr  compactum)  is  much  larger  than  X  —  G.  Indeed,  we  show 
that  even  staying  on  the  Bohr  compactum  we  still  have  ^  1/2,  see  below. 

Similarly,  J^Md{S)  is  the  algebra  of  absolutely  convergent  Dirichlet  series 

fh)  =  ^ 

x^S 

with  J2x€S  I/^({^})I  <  classical  Dirichlet  series  corresponds  to  the  case 

S  —  R+,  Sb  =  {z  e  C  :  Re{z)  ^  0}  with  the  pairing  (x,  z)  t — >  e~^^.  Like  Md{G),  the 
algebra  A^d(5')  is  an  inversion  stable  subalgebra  of  Al(5).  Hence,  Theorem  5.2.2  is  still 
valid  for  this  algebra  as  well. 

5.5.  A  lower  estimate  for  the  measure  algebras 

In  this  section,  we  show  how  to  get  a  common  lower  estimate  of  Ci((5,  A,  X)  for  the 
measure  algebras  of  groups  and  semigroups,  and  for  their  subalgebras  considered  above.  In 
particular,  we  show  that  the  critical  constant  6i  is  greater  than  or  equal  to  \  for  every  infinite 
LCA  group  and  for  the  most  part  of  semigroups. 

In  fact,  we  dispose  two  approaches  to  lower  estimates.  The  first  one  is  based  directly  on 
the  existence  and  properties  of  measures  carried  by  independent  Cantor  sets;  in  what  follows 
a  special  kind  of  such  measures  (Sreider  measures)  is  used.  The  corresponding  theory  is 
surely  one  of  the  most  subtle  chapters  of  measure  algebra  theory,  see  [GRS],  [Rul],  [GMG]. 
The  second  method  is  inspired  by  H.  Shapiro’s  example  [Shi].  In  our  setting,  it  depends  on 
some  improvements  of  the  technique  of  exponentials  norm  behaviour  for  t  >  0. 

This  technique  is  well  known  to  be  the  ground  of  the  theory  of  functions  operating  on  an 
algebra,  see  [Rul],  [GMG].  The  first  way  is  faster,  and,  following  [N4],  we  use  it  in  this 
section.  For  the  second  one,  more  explicit,  we  refer  to  [ENZ]  and  [Sh2]. 

We  start  by  recalling  some  classical  facts  on  measure  algebras. 

5.5.1.  Sreider  measures.  Let  be  a  nondiscrete  LCA  group.  It  is  known  that  there 
exists  a  positive  measure  fi  e  M{G)  such  that  fl{G)  C  [—1,1]  and  =  D,  \\fi\\  =  1; 
see  Sreider  [Sr]  for  G  =  R,  and  [Rul],  Theorem  5.3.4,  for  the  general  case  and  history.  We 
call  such  /i’s  Sreider  measures.  Moreover,  it  is  known  that  there  exist  continuous  Sreider 
measures  ^  such  that  the  convolution  powers  and  their  translates,  (5a,  *  /x*,  are  all  mutually 
singular,  see  [Rul],  [GMG].  It  is  worth  mentioning  that  0  6  jji{G)  for  all  known  examples  of 
such  measures. 

5.5.2.  Bohr  compactiiication.  For  an  LCA  group  G,  we  denote  by  Gd  the  dual  group 
G  endowed  with  the  discrete  topology,  and  set  G  =  (Gd)"-  The  compact  group  G  is  called 
the  Bohr  compactification  of  G;  in  fact,  G  is  a  dense  subset  of  G,  and  G  =  9Jt(A^{Gd))  == 
^{Md[G)). 
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5.53.  How  to  get  lower  estimates  on  groups.  Here  we  explain  a  method  to  prove  lower 
estimates  for  critical  constants  and  norm-controlling  constants,  making  use  of  Sreider  mea¬ 
sures  and  Bohr  compactification.  One  can  call  the  method  “a  walk  to  the  Bohr  compactum”. 
In  particular,  we  get  the  needed  estimates  for  all  three  algebras  on  groups  we  considered 
above,  namely,  for  M(G),  L^{G)  -f  C  *  and  Md{G). 

The  method  consists  of  the  following  steps  (see  [N4]  for  more  details): 

1)  staying  on  an  infinite  group  G,  we  lift  ourselves  up  to  the  Bohr  group  G  D  G; 

2)  being  nondiscrete,  G  carries  a  Sreider  measure,  say  i>,  whose  polynomials  give  the 
required  lower  estimate,  see  Lemma  5.5.4  below; 

3)  using  almost  periodic  approximations  of  Tiy,  we  obtain  the  same  lower  estimate  on 
G  (via  the  classical  Bochner  criterion  for  the  membership  in  TM{G),  see  [Rul],  Theorem 
1.9.1). 

Of  course,  for  a  nondiscrete  group  G,  steps  1)  and  3)  are  not  necessary.  To  accomplish 
step  3)  for  general  G,  we  need  a  kind  of  a  strengthened  weak  topology,  precisely,  a  version 
of  Beurling’s  narrow  topology.  Namely,  we  say  that  a  net  {pi)iei  C  M(G)  is  G-convergent 
to  a  measure  n  G  M{G)  if  limig;||//i||  ^  ||//||  and  =  fL{f)  for  7  e  6.  A  set 

A  C  M{G)  is  said  to  be  G-dense  in  a  set  5  c  M[G)  if  every  /i  G  B  is  a  (5-limit  of  a  net  of 
measures  belonging  to  A. 

The  following  lemma  is  a  straightforward  consequence  of  the  spectral  mapping  theorem. 

Lemma  5.5.4.  Let  be  a  Sreider  measure  on  a  nondiscrete  LCA  group  G,  and  let  p  = 
(^e  +  (1  -  (5)z/^,  where  1/2  <6^1,  and e  —  60  stands  for  the  unit  ofM{G).  Then  ||/i||  =  1, 
A(G)c[<5,l]and||M-‘|l  =  (25-l)-'. 

Theorem  5.5.5.  Let  G  be  an  infinite  LCA  group,  and  let  A  be  a  unital  subalgebra  of 
M(G)  (not  necessarily  closed)  satisfying  the  following  conditions:  (i)  A  is  G -symmetric; 
(ii)  A  is  G-dense  in  M(G).  Further,  given  a  number  S,  1/2  <  S  ^  1,  let  A{[S,  1])  —  {a  e 
A:  ||a||  ^  1,  a{G)  C  [5,1]}.  Then 

sup  ||g!“^||  ^  {26  —  1)"^ 

aeA([<y,l]) 

In  particular,  (^i(A,  G)  ^  1/2,  and  ci{S,  A,  G)  ^  {25  -  1)"^  for  ^  <  5  ^  1. 

The  algebras  A  —  M{G),  A  =  (G)  +  C*  e,  and  A  =  Md{G)  satisfy  conditions  (i)  and 

(ii),  and  hence  the  conclusions  hold  for  these  algebras. 

5.5.6.  How  to  get  a  lower  estimate  on  semigroups.  Here  we  describe  a  semigroup 
counterpart  of  theorem  5.5.5  by  using  the  same  method  of  moving  to  the  Bohr  compactum. 
However,  the  construction  of  measures  giving  the  maximum  to  \\p~^\\  when  inf  |/i|  is  fixed 
is  necessarily  different.  Indeed,  the  previous  construction  is  based  on  the  G—  symmetry  of 
M{G)  and  on  positive  semidefiniteness  of  the  corresponding  Fourier  transforms.  Both  rea¬ 
sons  fail  for  semigroups.  Instead,  we  are  using  a  more  complicated,  but  general  construction 
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of  theorem  2.3.1.  By  the  way,  this  gives  another  proof  to  theorem  5.5.5;  namely,  theorem 
2.3.1  and  lemma  5.5.7  below  imply  5.5.5. 

In  order  to  realize  the  technique  of  narrow  approximations  on  a  semigroup  instead  of  the 
entire  group,  we  restrict  ourselves  to  a  class  of  semigroups  S  described  in  Subsection  5.2.1. 
This  class  contains  all  frequently  used  semigroups,  in  particular,  all  examples  of  Subsection 
5.3  above. 

Until  the  end  of  this  Section,  we  denote  by  ^  a  subalgebra  of  M{G)^  and  by  A  a  subal¬ 
gebra  of  M{S). 

Lemma  5.5.7.  Let  G  be  an  infinite  LCA  group,  and  let  A  be  a  subalgebra  of  M{G) 
which  is  G-symmetric  and  G -dense  in  M(G)  (but  not  necessarily  closed  and/or  unital). 
Then,  A  satisfies  (2.3)  with  X  =  G. 

Now,  we  describe  a  class  of  semigroups  S  and  a  class  of  subalgebras  A  c  M(S)  where 
the  method  based  on  theorem  2.3.1  works;  see  [N4]  for  more  details. 

5.5.8.  Absorbing  semigroups  and  S'-subalgebras.  Let  5  be  a  semigroup  embedded  into 
an  LCA  group  G,  S  C  G,  and  satisfying  hypotheses  (i)-(in)  of  Subsection  5.5.1.  We  say  that 
G  satisfies  the  absorbtion  condition  if  the  following  is  true. 

(iv)  For  every  compact  set  K  C  G  there  exists  an  element  x  e  G  such  that  x  K  C  S. 

If  S  is  absorbing,  there  exists  an  element  x  e  S  such  that  x  +  K  C  S  (consider  if  U  {0}), 
and,  therefore,  ( iv)  implies  that  S  generates  G  in  the  sense  G  =  S-S.  On  the  other  hand,  if  S 
is  generating  and  <S'\{0}  is  open,  or  S  contains  an  open  generating  part,  then  S  is  absorbing. 
For  instance,  this  is  the  case  for  all  examples  of  Subsection  5.3. 

Now,  we  describe  the  class  of  subalgebras  of  M{S)  we  are  working  with.  Namely,  let  S 
be  a  semigroup,  S  C  G.  We  say  that  A  is  an  S-subalgebra  of  M{S)  if  A  is  a  subalgebra 
ofM{S)  containing  a  “smalU  subalgebra  of  the  form  PsA  +  Ce,  where  A  C  M{G)  stands 
for  a  subalgebra  of  M{G)  verifying  the  following  conditions  (compare  with  the  conditions 
of  theorem  5.5.5): 

(i)  A  is  G-symmetric; 

(ii)  A  is  G— dense  in  M{G); 

(Hi)  A  is  5-invariant,  that  is,  *  A  C  A  for  a;  G  5; 

(iv)  G  is  a  boundary  for  9PT(A),  that  is,  lim„  =  sup^^  \p\  for  every  p  e  A. 

Observe  that  A  is  not  assumed  to  be  unital.  The  standard  subalgebras  A  =  Md{S) 
and  A  =  L^(S)  C-e,  with  obvious  group  counterparts  A  =  Md{G)  and  A  —  L^(G), 
respectively,  are  5-subalgebras  of  A^(5).  Hence,  the  same  is  true  of  any  bigger  subalgebra. 
For  instance,  Md{S)  +  L^(5),  and  Af  (5)  itself,  are  5-subalgebras.  Another  example  is  the 
algebra  M /  (5)  of  finitely  supported  measures  on  5. 

Now,  we  are  ready  to  state  the  following  lower  estimate  for  semigroups. 
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Theorem  5.5.9.  LetS  be  a  semigroup  satisfying  conditions  (i)-( Hi)  of  Subsection  5. 2 A 
and  the  absorbtion  condition  (iv).  Let  A  be  an  S -subalgebra  of  M{S)  (not  necessarily 
closed).  Then6i{A,Sb)  >  1/2  and  Ci{S,  A,  St)  ^  (2(^  -  1)"^  forallS,  1/2<S<  1. 

For  the  proof,  we  check  (2.3)  with  X  —  Sf,,  see  [N4],  theorem  3.3.7. 

6.  Some  Finite  Groups  and  Semigroups 

In  this  section  we  briefly  consider  the  measure  algebras  M  on  finite  groups  and  on  finite 
semigroups.  In  general,  exact  computations  of  the  majorants  Cn(S,M,  X)  for  these  groups 
and  semigroups  are,  probably,  even  more  complicated  than  in  the  infinite  case.  This  is  why 
we  mainly  restrict  ourselves  to  two  examples:  to  cyclic  groups  Cd  =  Jj/dZ  of  order  d  ^  1, 
to  nilpotent  semigroups  Zd  =  +  Z+)  of  order  d.  In  a  sense,  the  groups  Cd  “exhaust” 

the  group  Z  and  the  semigroups  Zd  “exhaust”  Z+.  Essentially,  we  consider  the  asymptotics 
of  Cn(5,M{Cd),  Cd)  and  Cn{S,M{Zd),  Zd)  as  c?  — ^  oo. 

6.1.  Finite  groups 

6.1.1.  Preliminaries.  Let  G  be  a  finite  group  written  additively,  and  me  the  invari¬ 
ant  (Haar)  measure  normalized  by  =  1  for  every  x  €  G.  Clearly,  the  space 

M{G)  =  L^{G)  =  l^(G)  is  a  convolution  Banach  algebra  with  the  unit  e  =  ^o-  All  complex 
homomorphisms  are  given  by  the  Fourier  (Gelfand)  transformation,  /  i — >  =  f(f)  ~ 

'^xeG  7),  7  ^  6,  where  G  is  the  dual  group  of  unimodular  characters  written  mul- 

tiplicatively.  Therefore,  dJl(M{G))  =  G.  The  Haar  measure  m  =  is  normalized  to 
have  total  mass  1,  so  that  the  Fourier  transformation  7^  is  a  unitary  operator  from  L^(G)  to 
L^(G).  It  is  easy  to  see  that  we  can  regard  the  dual  group  G  as  the  visible  spectrum  for  any 
convolution  algebra  on  G.  That  is,  in  our  previous  notation,  we  set  X  =  G. 

Using  equivalence  of  every  two  norms  on  a  finite  dimensional  vector  space,  one  can  show 
that  for  finite  groups  the  majorants  Ci((5,  A,  G)  always  have  the  linear  growth  rate  as  S  — >  0. 
For  example,  ||m“^||  ^  k{A)/5  for  every  p  e  A  satisfying  \fi(f)\  ^  S  for  all  7  G  G. 
Here  k{A)  is  a  constant  depending  only  on  a  convolution  algebra  A  on  a  finite  group  G,  and 
k{M{G))  <  (card  (G))i/2. 

We  can  say  more  for  the  special  case  of  cyclic  groups  Gd- 

6.2.  The  cyclic  group  Gd 

Let  G  =  Cd  =  ZjdZ—  {0, 1, . . . ,  rf  -  1}  be  the  cyclic  group  endowed  with  the  quotient 
composition.  The  dual  group  Cd  is  the  group  of  d-th  roots  of  unity  Gd  —  {Cfc  —  •  0  ^ 

k  ^  d  -  1},  where  (  =  ((d)  =  e27rz/d  Fourier  transform  of  an  element  /  G  M{Cd) 

is  Ff  iCk)  =  fiCk)  =  Ilo^s<df{^)Q>  (k  e  Cd,  and  the  norm  is  ||/||  =  T.o^s<d  l/(5)l-  Let 
Ck  =  X{k]  be  the  basic  functions  on  Gd.  The  convolution  on  Gd,  which  we  denote  by  o, 
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follows  the  rule  Cr  o  Cs  =  where  t  e  Cd^t  =  (r-\-  s)mod  (d).  Since  we  cannot  compute 
Cn{S,  M{Cd),Cd),  we  consider  the  behaviour  of  the  upper  bound  c„(5,  {(7^})  defined  by 

(6.1)  Cn{S,  {Cd})  ^  sup{c„(J,  Ad(C'd),  Crf)  :  d  ^  1},  0<S  ^1. 

The  following  theorem  shows  that,  in  a  sense,  the  algebras  M{Cd)  approximate  the  algebra 
M{Z)  =  l\Z)  asd^oo. 

Theorem  6.2.1.  (i)  Cn{S,M{Cd),Cd)  ^  d^/^  •  min{(5~^,(5"^n^/^)  for  all  0  <  d  ^  1  and 
n  ^  1. 

(ii)  Cn(S,M(Cd)y  Cd)  ^  (2(5^  -  1)~^  for  all  1/v^  <  d  ^  1  andn  ^  1. 

(Hi)  Cn(S,  {Cd})  ^  Cn{5,M{Z),l!)  for  allO  <  S  ^  1,  where  c„(d,  {Cd})  is  defined  in 

(6.1) .  In  particular,  Ci(d,  {Cd})  =  oo  for  0  <  6  ^  1/2,  and  ci(d,  {Cd})  >  (2d  ~  1)“^  for 
1/2  <  d  <  1. 

6.3.  The  nilpotent  semigroup  Zd 

The  semigroup  Zd  =  Z+/(d+Z+)  is  defined  as  the  set  Zd  —  {0, 1, . . . ,  d- 1,  d}  endowed 
with  the  operation  (s,  t)  i — >  min(s  +  t,d),  and  with  the  measure  m({s})  =  1  for  0  <  s  <  d 
and  m({d})  =  0.  Therefore,  on  the  basic  functions  e^,  es(t)  =  Ss,t  (the  Kronecker  delta),  the 
convolution  is  defined  by  the  formula 

Cs  o  Ci  =  Cs+t  for  s  + 1  <  d,  and  o  e*  =  0  for  s  +  t'^  d. 

The  space  M{Zd)  =  L^(Zd,  m)  of  all  measures  (functions)  on  Zd  endowed  with  this  convo¬ 
lution  and  with  the  usual  norm  is  a  unital  d-nilpotent  Banach  algebra.  Namely,  the  algebra 
A^(Zd)  has  a  generator  ei  such  that  ef  —  0.  Hence,  the  only  character  on  Zd  is  the  trivial 
one:  0  i — >  1  and  s  i — >  0  for  5  >  0.  We  write  Zd  =  {0},  and  ^{M{Zd))  =  {0}  with  the 
only  homomorphism  on  A^(Zd),  namely  p  —  ip(s))o^s«i ' — >  /^(O)- 

Theorem  6.3.1  below  gives  the  exact  value  of  the  majorant  ci(d,  Ad(Zd),  {0})  and  shows 
that  the  algebra  Ad(Z+)  =  /^(Z+)  is,  in  a  sense,  the  limit  of  M{Zd)  as  d  — >  oo. 

Theorem  6.3.1.  For  all  0  <  5  ^  1,  ci(d,  M(Zd),  {0})  =  d"^ 
therefore,  Ci(d,  Ad(Zd),  {0})  ~  d"^^  as  S  — y  0.  Moreover, 

ci(d,{Zd})  =:  sup{ci(d,Ad(Zd),{0})  :  d  ^  1}  =  ci(d,  yW(Z+),5) 

for  all  0  <  d  <  1,  that  is,  ci(d,  {Zd})  =  oo  forO  <  d  <  1/2,  andci(S,  {Zd})  =  (2d  -  1)”^ 
forl/2<d<l. 

7.  The  Weighted  Beurling-Soholev  Algebras 

In  this  section,  we  present  some  results  on  estimates  of  the  inverses  in  Beurling-Sobolev 
algebras  contained  in  [ENZ].  The  principal  conclusion  is  that  the  spectrum  971(A)  of  a  “suffi¬ 
ciently  smooth”  Beurling-Sobolev  algebra  A  =  P{Z,  w),  as  well  as  of  its  analytic  counterpart 
A  =  P{Z+,  w),  is  (d  -  l)-visible  for  every  d  >  0,  that  is  di(A,  971(A))  =  0.  We  also  show 
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explicit  upper  bounds  for  Ci(6,  A,dJl{A))  for  S  >  0.  The  weights  w  are  subject  to  some 
regularity  conditions. 

In  particular,  the  analytic  Wiener  algebra  which  admits  no  norm  control  for  the 

inverses  for  0  <  5  ^  1/2  (see  Section  5),  turns  out  to  be  the  only  exception  in  the  scale  of 
weighted  algebras  up  to  the  weight  regularity  mentioned  above.  The  reason  for 

this  difference  lies  in  a  sort  of  “asymptotic  compactness”  of  the  algebra  multiplication  in  the 
presence  of  a  non-trivial  weight  regularly  tending  to  infinity,  and  in  its  absence  for  the  un¬ 
weighted  case;  see  comments  to  theorem  7.2.4  for  the  definitions.  Technically,  the  phenome¬ 
non  mentioned  is  measured  by  the  compactness  of  the  embedding  of  the  algebra  P(Z+,  w)  or 
/P(Z,  w)  in  the  multiplier  algebra  mult  {l^{w'))  of  the  space  of  derivatives  l^(w')  -  VIp(w), 
as  is  required  by  the  method  of  Section  3. 

7.1.  The  Beurling- Sobolev  algebras  P{w) 

We  start  by  fixing  notation  and  recalling  some  basic  facts  on  the  weighted  convolution 
algebras  P{w),  and  on  the  Beurling-Sobolev  algebras;  by  our  definition,  the  latter  represent 
a  special  case  of  P{w).  Then  we  describe  regularly  growing  weights  generating  a  Beurling- 
Sobolev  algebra. 

7.1.1.  The  topological  Banach  algebras  Ip(Z,w),  For  a  positive  function  (a  weight) 
w  :  Z  — ^(0,  oo)  and  an  exponent  1  ^  p  <  oo,  we  denote 

F{Z,w)  =  {x^  (x„)nez  :  tw  £  F{Z)}  =  {x  :  ||x||p,u;  <  oo}, 

where  ||x||p,u)  =  (Enez  <  oo,  with  the  usual  modification  for  p  =  oo, 

=  'sup^^j^\xnw{n)\  <  00.  The  convolution  of  two  finitely  supported  sequences 
is  defined  in  the  usual  way,  x*y  =  (Ejtez  Xkyn-k)n^z-  h  is  well  known  that  the  convolution 
can  be  extended  to  a  continuous  multiplication  on  /^(Z,  w)  provided  that 

^  Mn)  YX'”' 

(71)  =  sup  >  ...  )  <00, 

where  p'  stands  for  the  conjugate  exponent,  1/p  +  1/p'  =  1-  The  case  p  =  1  is  classical.  For 
p  >  1,  the  sufficiency  of  the  continuous  analogue  of  (7.1)  for  the  weighted  space  IF(R,  w)  to 
be  a  convolution  algebra  was  proved  by  J.  Wermer  in  [W].  For  the  case  of  F{Z,  w),  the  suffi¬ 
ciency  of  (7.1),  as  well  as  the  necessity  of  it  in  the  case  p  =  oo,  was  proved  in  [N3]  (without 
knowing  about  Wermer’s  result).  In  [KL],  it  was  shown  that,  in  general,  (7.1)  is  not  necessary, 
but  for  an  even  {w{—n)  =  w{n))  and  log-concave  weight  w  this  condition  is  necessary  for 
any  p.  However,  a  practically  useful  necessary  and  sufficient  condition  for  regularly  varying 
weights  first  appeared  only  in  [ENZ],  see  below  theorem  7.1.5.  For  information  on  weighted 
convolution  IF  algebras  on  E”,  see  [KS]. 

Assuming  (7.1),  we  get  \\x  *  p||p,^  ^  C'p^^llxUp^^llplIp,^  for  every  x,y  £  w).  Hence, 

F(Z,  w)  becomes  a  unital  topological  Banach  algebra  with  the  unit  e  =  eo  =  ((5ofc)fcGZ-  For 
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p  =  1,  the  condition  w(0)  —  1,  w{n  +  A:)  <  w{n)w{k)  obviously  is  necessary  and  sufficient 
for  (Z,  w)  to  be  a  Banach  algebra.  It  is  an  exercise  to  show  that  for  p  >  1  the  norm  ||  • 
is  never  a  Banach  algebra  norm;  on  the  contrary,  for  p  =  1  every  topological  Banach  algebra 
weight  is  equivalent  to  a  Banach  algebra  weight. 

By  a  Beurling-Sobolev  algebra  we  mean  the  space  Ip{Z,w)  satisfying  condition  (7.1)  and 
endowed  with  the  norm  ||  •  We  recall  that  the  weighted  Banach  algebras  are  usually 
called  Beurling  algebras;  in  the  general  case  of  the  algebras  P{Z,  w),  there  are  reasons  for 
adding  the  name  of  S.  L.  Sobolev,  because  for  p  —  2  and  w{n)  =  (1  +  |n|)“  we  get  the 
standard  Sobolev  spaces  (algebras)  on  T  as  the  spaces  w)  of  Fourier  transforms. 

7.1.2.  Maximal  ideals.  A  necessary  condition.  Since  A  =  /^(Z,  w)  satisfies  axioms  (i) 
and  (ii)  of  Subsection  3.1,  the  spectrum  of  P{Z,w)  is  the  annulus  A(r’_,r+),  A(r_,r+)  ~ 
{A  6  C  :  ^  |A1  ^  r+(u;)},  where  r_  =  =  limfc_).„c5o >  0,  = 

r+(w)  -  <  oo. 

The  Gelfand  transform  is  given  by  x  i — >  x, 

x{X)  =  Tx{X)  =  ^XkX^,Xe  A(r_,r+). 

k^X 

Moreover,  it  follows  from  the  inclusion  w)  C  C'(A(r„,  r+))  that  the  algebra  /^(Z,  w) 

is  always  embedded  into  the  weighted  algebra 

=  {2; :  w\rt  +  Yh  \^i=\rX  <  oo}. 

k<0  k'^0 

This  provides  a  necessary  condition  for  P(Z,ii;)  to  be  an  algebra,  namely,  {r%lw[n))  e 
/p'(Z),  or,  equivalently, 


with  the  usual  modification  for  p'  =  00. 

7.1.3.  Remarks  and  notation.  With  appropriate  modifications,  similar  facts  are  true  for 
the  analytic  Beurling-Sobolev  algebras  /^(Z+,  In  this  Section,  we  consider  the  case  of 
positive  spectral  radius  only^  r+(iy)  >  0. 

In  the  case  of  /^(Z,  ly),  we  often  assume  a  sort  of  weak  symmetry  of  w,  requiring  that 
r_  =  r_)..  Without  loss  of  generality,  we  can  normalize  the  weight  so  as  to  have  7*+  =  1.  A 
general  method  for  obtaining  algebras  /^(Z,  w)  with  arbitrary  spectral  radii  is  as  follows.  Let 
/^(Z,  w)  be  a  Beurling-Sobolev  algebra,  and  R  =  (i?_,  R^)  be  two  positive  numbers  such 
that  R^  ^  Then,  if  we  set  WR{n)  =  i^Sgn(n)^(^)  n  €  Z,  we  get  an  algebra  weight 
satisfying  ^  and  T±{yVR)  =  R±r±.  In  particular,  starting  with  r_  =  r+  =  1,  we 

can  obtain  an  arbitrary  pair  of  spectral  radii. 
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Notation.  We  use  the  symbol  respectively  as  common  notation  for  P(Z)  and 
respectively  for  /^(Z,  w)  and  /^(Z+,  id). 

7.1.4.  The  Beurling-Sobolev  algebras  l^[w).  Here  we  give  regularity  conditions  on  w 
guaranteeing  that  is  a  Beurling-Sobolev  algebra. 

Theorem  7.1.5.  I  Let  1  ^  p  ^  oo,  and  v  =  \og{w)  be  a  positive  eventually  concave 
function  on  E+  such  that  =  lima;_^oo  =  1-  Assume  that  the  following  condition 

is  fulfilled:  either,  (a)  there  exists  a  >  0  such  thatw(x)  jx^  eventually  decreases;  or,  (b)  there 
exists  a  >  1/p'  such  that  w{x)/x^  eventually  increases  and  has  concave  logarithm. 

Then  the  following  assertions  are  equivalent. 

(i)  The  space  /^(Z+,  w)  is  an  algebra. 

(ii)  The  space  /^(Z+,  w)  is  a  Beurling-Sobolev  algebra. 

(Hi)  1/w  e  Z/'(R+),  or,  equivalently,  (l/if;(n))„^o  ^  • 

IL  Let  It;  be  a  weight  function  on  Z  such  that  r_  =  —  1.  Assume  that  (u;(7i))„^o 

and  {w(—n))n^o  are  quasiincreasing  sequences,  that  is  sup„j^o  +  /)  < 

similarly  for  {if;(“n))„^o-  Then  P(Z,  w)  is  a  Beurling-Sobolev  algebra  if  and  only  if  so  are 
lP{Z+,w)  and  P{Z„,tn). 

7.1.6.  Examples,  (i)  l^{\n\^)  is  a  Beurling-Sobolev  algebra  if  and  only  if  ap'  >  1  for 
p  >  1,  and  0  forp  =  1.  Here  x^  =  max(3:,  1)  forx  £  R. 

(ii)  Ifl^{wa)  is  a  Beurling  algebra,  Wa{n)  =  w{n)/\n\%  and  op'  >  1,  then  (^{w)  is  a 
Beurling-Sobolev  algebra. 

1.2.  When  is  the  embedding  l^{w)  C  mult  (VIp{w))  compact? 

As  was  shown  in  Section  3,  the  compactness  of  this  embedding  plays  a  crucial  role  in  the 
estimates  of  the  norms  of  inverses.  The  best  polynomial  approximations,  measured  by  the 
characteristics  of  this  embedding,  make  it  possible  to  control  the  norms  of  the  inverses.  This 
completes  the  program  proposed  in  Section  3  for  estimates  of  A,  QH(A))  in  the  case  of 
Beurling-Sobolev  algebras  A  ~  l^{w).  We  consider  the  special  case  p  =  1  of  Beurling  alge¬ 
bras  separately,  because  in  this  case  we  can  state  an  explicit  necessary  and  sufficient  condition 
for  compactness.  Similar  analysis  can  be  made  for  For  the  Beurling-Sobolev  algebras 

l^{w),  1  <  p  <  oo,  some  broad  sufficient  conditions  are  obtained.  In  this  Subsection,  we 
follow  the  paper  [ENZ]. 

The  case  of  the  group  Z  is  slightly  different  from  the  semigroup  Z+,  because  in  these  two 
cases  the  convolutions  differ  in  nature.  Namely,  on  Z,  to  form  (x  *  xj)n  we  add  products  XkVj 
with  k  and  j  both  unbounded,  and  for  this  is  not  the  case.  In  particular,  this  elementary 
remark  explains  the  difference  between  the  nature  of  the  weight  a(n)  =  ||en||muit  on  Z+ 
from  that  on  Z,  see  1. 1.5-1. 2.6.  For  instance,  for  any  analytic  Beurling-Sobolev  algebra 
A  =  /P(Z+,  w)  we  always  have  A  C  mult  (A'),  but  for  the  algebras  A  -  lP(Z,w),in  general, 
this  inclusion  fails. 
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As  before,  the  weight  w'  is  defined  by  the  formula  w'{n)  =  w[n)l\n\^,  where  |n|*  = 
max(|n|,  1).  It  is  clear  that  A’  =  VA  =  ^{w^)  for  A  =  l^{w). 

7.2.1.  Continuous  embeddings  l^{w)  C  mult  {l^{w‘)).  As  was  just  mentioned,  an  ana¬ 
lytic  Beurling-Sobolev  algebra  w)  is  always  contained  in  mult  w')). 

Theorem  7.2.2.  /f  P(Z+,  w)  is  a  Beurling-Sobolev  algebra,  then 
F(Z+,  w)  C  mult  u;')) 

and  the  norm  of  this  embedding  does  not  exceed  Cp^w. 

It  is  easy  to  see  that  for  the  group  case,  that  is,  for  w)  in  place  of  /^(Z^.,  w),  some 
more  growth  restrictions  on  w  are  required  for  the  inclusion  A  =  F{Z,w)  C  mult  {A')  = 
mult  (^^(Z,  u;')).  Indeed,  the  condition  c  =  sup;t(lkfcllmuit(^o/ll^*llp.“)  <  oo  is  necessary 
for  such  an  inclusion;  it  implies  that  w{k)w(—k)  ^  const  •  |/!:|*  for  all  /c  G  Z. 

Thus,  for  every  1  ^  p  <  2  there  exist  regularly  growing  weights  w  satisfying  the  condi¬ 
tion  (iii)  of  theorem  7.1.5  but  c  =  oo  (take  w{k)  =  with  1/p^  <  a  <  1/2).  Consequently, 
there  are  algebras  P(Z,  w)  for  which  the  embedding  /^(Z,  w)  C  mult  (/^(Z,  w'))  fails.  On 
the  contrary,  for  p  ^  2,  any  algebra  /^(Z,  u;)  with  regularly  growing  weight  seems  to  be 
contained  in  mult 

On  the  other  hand,  we  can  guarantee  the  embedding  in  question  (and  even  the  compact¬ 
ness  of  this  embedding),  if  we  require  a  stronger  algebra  condition.  Below,  we  present  some 
results  from  [ENZ]  obtained  by  using  two  different  approaches. 

First,  we  give  a  simple  sufficient  condition  for  the  embedding  ll{w)  C  mult  (/^(ly')) 
factorizing  it  through  an  auxiliary  -space. 


Theorem  7.2.3,  I  For  any  weighted  space  A  —  l^(w),  or  A  —  Iq{w),  we  have 


where 

(7.2) 


||^n!lmult(i4')  — 


a{n)  =  sup 

k 


\k\^w{k  +  n) 
\k  +  n\^w{k)' 


k,n  e  Z,  or  k^n  e  Z+,  respectively.  Hence,  we  have  the  contractive  embedding  (a)  C 
mult  (A'). 

11.  The  embedding  IP  (w)  C  is  equivalent  to 


(7.3)  -el”'. 

W 

Being  valid,  this  embedding  is  automatically  compact  for  p  >  1,  and  it  is  compact  for  p  =  1 
if  and  only  if 
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where  k  — y  oo  in  the  case  of  Z+,  and  |A:|  — >  oo  in  the  case  of  Z. 

IIL  For  every  weighted  space  l^{w),  condition  (7.3)  for  p  >  1,  and  condition  (7.4)  for 
p~l,  imply  that  lP{w)  Cc  mult  {[^(w’)). 

IV.  For  a  Beurling  algebra  A  =  /^{Z+,  w)  the  following  are  equivalent 

a)  the  embedding  A  C  B  =  mult  (A^)  is  compact, 

b) l\Z^,w)cF(Z^,a)is  compact, 

C)  lim„,A:— >oo  =  0, 

d)  the  multiplication  in  F  (Z+,  w)  is  asymptotically  compact. 

Moreover,  if  Aq  -  {x  e  A  :  \\x\\a  ^  1}  is  the  unit  ball  of  A  =  F{Z+,w),  then 
en(Ao,B)  =  supk^j^(a(k)/w(k)}  where  B)  stands  for  the  best  polynomial  approxi¬ 

mation  of  degree  n  as  defined  in  3.2.3. 

“Asymptotically  compact  multiplication”  mentioned  above  means  that  for  every  e  >  0 
there  exists  an  integer  N  such  that  ||a;  *  y\\  <  e\\x\\  •  ||?/||  for  every  x,y  E  w)  satisfying 

Xk  =  yk  =  0  for  0  ^  k  <  N . 

The  conditions  a),  b)  and  d)  are  still  equivalent  (after  obvious  modifications)  for  Beurling 
algebras  /^(Z,  w)  on  Z,  but  the  behaviour  of  the  weight  a  is  quite  different  in  the  cases  of  Z 
and  Z+.  Examples  of  weights  w,  for  which  the  rate  of  the  best  polynomial  approximations 
e„(Ao,  B)  can  be  computed  explicitly,  can  be  provided  using  the  known  Hardy  field  of  func¬ 
tions,  see  Bourbaki  [Bou].  In  particular  such  a  weight  satisfying  lim^c— >oo  log(^(3^))  =  0 
generates  an  algebra  F{Zj^^  w)  and  satisfies  the  following  dichotomy: 

(i)  either\imx^oo{w{x)/x)  =  0,  and  then  a{n,w)/w{n)  ~ 

(ii)  or\imx^oo{w{x) /x)  >  0,  and  then  a(n,w)/w(n)  ~ 

7.2.4,  The  best  polynomial  approximations  related  to  the  embedding 

F{w)  C  mult  {F(w')) 

It  is  the  key  point  of  our  approach.  Knowing  em(Ao,  B),  it  remains  only  to  apply  the  theory 
of  Section  3.  As  before,  the  cases  of  Z+  and  Z  are  slightly  different. 

Theorem  7.2.5.  Let  1  <  p  ^  oo;  suppose  that  v  =  log(^i;)  satisfies  the  conditions 
(a),  (b),  and  (Hi)  of  theorem  7.1.5.  Then  the  space  F(Z^,w)  is  an  algebra,  the  embedding 
A  =  /P(Z+,  w)  C  B  =  mult  (F(Z,+,  w'))  is  compact,  and  the  following  estimates  are  valid 
for  the  best  approximations  em(Ao,B)  of  the  unit  ball  Aq  C  A. 

If  condition  (a)  is  satisfied  with  an  exponent  a  ^  1,  then 

j>m 

If  condition  (b)  is  satisfied,  then  ejn(Ao,B)  ^  c/m  for  a  >  1  +  1/p';  CmiAo^B)  ^  c  • 
(log(m))^/^7^  for  q;  =  1  +  1/p',  and  ejn(Ao,B)  ^  for  1/p'  <  a  <  1  +  1/p', 

where  c  stands  for  a  constant  depending  on  a  and  p. 
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As  is  mentioned  above,  for  the  case  of  Z  we  have  some  extra  constraints  in  order  that  the 
embeddings  in  question  would  be  compact.  For  instance,  the  condition 

lim  (i|e„||B/||e„|U)  =  0 

in|— >0 

is  obviously  necessary  for  A  =  Ip{Z,w)  Cc  B  —  mult  {P{Z,w')).  Since  \\en\\B  —  (^{n)  ^ 
|n|*i<;(0)/iti(-n),  we  obtain  that  if  lim|„[  .  ^^\n\w(n)w(-n)  >  0,  the  embedding  /^(Z,  w)  C 
mult  (/^(Z,  n;'))  cannot  be  compact. 

Theorem  7.2.6.  Letw  be  a  weakly  symmetric  nonnalized  weight,  that  is  =  r_  —  1. 
Each  of  the  following  conditions  implies  that  P{Z,w)  is  a  Beurling-Sobolev  algebra  com¬ 
pactly  embedded  into  the  multiplier  space,  A  =  l^{Z,w)  Cc  B  =  mult  {P{Z,w')),  with  the 
following  upper  bounds  for  the  best  polynomial  approximations  em(Ao,  B)  of  the  unit  ball 
Ao  C  A.  Here  Ern{p,  oi)  stands  for  the  right  hand  side  of  the  corresponding  inequality  in 
theorem  7.2.5  if  a  >  1,  and  Sm(p,  a)  =  c  •  otherwise. 

If  Ci^wa  <  oo  for  an  exponent  a  >  2“^(1  +  1/p'),  where  Wa  =  {'d){k)/\k\'^)k^o,  then 
B)  ^  Cl,wa^m{P) 

If  ^PyWa  ^  and  Q!  1/2,  then  6jji{^Aq,  B^  ^ 

For  slowly  increasing  weights,  we  refer  to  [ENZ]  for  a  simple  direct  estimate  for  the  best 
approximations  em{Ao,B). 

7.3.  Some  explicit  estimates  of  the  norm  controlling  constants  ci  ((^,  l^{w)) 

Now  we  are  ready  to  obtain  explicit  estimates  of  the  inverses  in  the  Beurling-Sobolev 
algebras.  To  this  end,  we  combine  theorems  of  Section  3  with  the  estimates  of  the  rate  of 
polynomial  approximations  €^(^0,  B)  provided  in  Subsections  7. 1-7.2.  Recall  that  the  ma- 
jorant  M  of  theorem  3.2.5  depends  on  the  distribution  function  Aq  of  the  sequence  em{AQ,  B) 
and  on  the  multiplier  norms  a{k)  of  the  basis  vectors  of  In  7.3.1  below  we  supposing 

=  r_  =  1;  for  the  case  where  r_  <  r+,  see  the  remark  at  the  end  of  this  Subsection. 

Theorem  7.3.1.  Let  A  be  a  Beurling-Sobolev  algebra,  A  =  /J(Z,  w)  or/J(Z+,  w),  com¬ 
pactly  embedded  into  B  =  mult  {P(w')).  Then  (5i(A,  971(A))  =  0,  and  for  all5  >  0  we  have 
the  estimate  ci{S,  A,^Xfl(A))  ^  'u;(0)<5“^  +  Mi{S),  where  M  is  given  in  theorem  3.2.5  and 
W^jWb  =  cr(j)  (see  formula  (7.2)),  and  the  constants  S  and  C  depend  on  the  constant  (7.1) 
and  on  the  norm  of  the  embedding  w)  C  B  =  mult  {P{w^)). 

7.3.2.  Examples  of  estimates  of  inverses  on  Z^.,  Here  are  some  typical  Beurling- 
Sobolev  algebras  A  =  /^(Z^.,  w),l  ^  p  ^  oo. 

(i)  w{n)  =  a  >  1  1/p'.  Then,  Ci(5, /P(Z+,nJ'))  ^  c(a:,p)/<5^®+^. 

(ii)  w(n)  =  nj,  q;  =  1  +  1/p'.  Then,  /^(Z+,nJ))  ^  c(q:, p)(log(  1/5)) 

(Hi)  w(n)  —  n^,  1/p'  <  a  <  1  +  1/p'.  Then,  ci(5, /p(Z+,  n*))  ^  c(a,p)/5^^+\  where 
P  =  (op'  -  l)-f 
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(iv)w{n)  =  e'*“,0  <  a  <  1.  Then,  ci{S,l’’(Z+,w))  $  c{a,p)  ■  exp{dQ(5  ">}  -(5  ^  • 

7.3.3.  Examples  of  estimates  of  inverses  on  Comparison  of  the  cases  of  Z  and  Z^. 
given  in  Subsections  7.1  and  7.2  shows  that  the  explicit  estimates  of  the  inverses  for  rapidly 
growing  weights  should  be  the  same  on  Z  and  on  Z+,  up  to  the  sharp  values  of  constants. 
Here,  “rapidly”  means  at  least  as  fast  as  the  linearly  growing  weight  w{n)  =  |n|*,  n  G  Z.  For 
slower  weights,  e.g.,  for  i/;(n)  —  |n|“,  a  <  1,  our  method  gives  faster  growth  of  the  constants 
Cl (5,  lp{w))  on  Z  than  on  Z+.  This  method  stops  completely  at  the  exponent  o  =  |(1  +  ^). 
For  this  case  we  refer  to  the  recent  paper  [E],  where  a  different  approach  based  on  a  Bjork’s 
paper  [B]  is  employed.  We  restrict  ourselves  to  a  few  examples  illustrating  the  above  theory. 

(i)  w{n)  =  |n|“,  a  >  1  +  1/p'  or  1  ^  o  ^  1  +  1/p'.  Then  one  has  the  same  results  as  in 
7.3.2  {if  (ii)  and  (Hi),  with  modified  constants. 

(ii) w{n)  =  |n|?,2-i(l+l/p')  <  a  <  1.  Thenci(5, /'’(Z,  |7i|?))  ^  c(a,p)/(52+2('+«P-»). 

Observe  that  the  right  hand  side  of  the  last  inequality  diverges  to  infinity  for  a  — > 

2“^(1  +  1/p')  =  ap,  because  — >  oo.  However,  in  order  to  get  a  finite  majorant,  we 
can  consider  a  weight  growing  slightly  faster  than  the  critical  weight  w{n)  =  |h|*^.  This  can 
be  done  using  ?rj(n)  =  |n|?'’(log(l  +  |nl*))^. 
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Abstract.  A  wide  variety  of  questions  of  Harmonic  Analysis  arise  naturally  in  var¬ 
ious  contexts  of  Analytic  Number  Theory;  in  what  follows  we  consider  a  number  of 
examples  of  this  type. 

The  author  is  grateful  to  Dr.  Ulrike  Vorhauer  for  advice  and  assistance  at  all  stages 
of  preparation  of  this  paper. 

I.  Uniform  Distribution 

The  definition  of  uniform  distribution  is  fairly  intuitive: 

Definition  1.  A  sequence  {u„}  g  T  w  uniformly  distributed  ij/orfltny  a,  0  ^  a  <  1, 
we  have  ^ 

lim  cardfl  <  n  ^  iV  :  Wn  €  [0,  a)  (mod  1)}  =  a. 

N^oo  JM 

Let  Un  be  a  measure  with  unit  masses  at  the  points  for  1  ^  n  <  TV .  Then  the 
Fourier  transform  of  Un  is  the  exponential  sum 

N 

UN(k)  =  J2e{-ku„) 

n=l 

where  e(^)  =  .  (This  notation  was  introduced  by  1.  M.  Vinogradov.)  H.  Weyl  [36,37] 

introduced  an  important  criterion  for  uniform  distribution  in  terms  of  the  size  of  the  Un  , 
namely  that  the  following  are  equivalent  statements  concerning  a  sequence  {u„}  : 
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(a)  The  sequence  {n„}  is  uniformly  distributed; 

(b)  For  each  integer  k  ^0,  U^ik)  =  o{N)  as  N  ^  oo\ 

(c)  If  F  is  properly  Riemann-integable  on  T  then 

From  the  formula  for  the  value  of  a  geometric  series  it  is  immediate  that 


N 


"^eine) 


n=l 


1 

""  IsinTT^I 


1 

m 


where  ||^||  is  the  distance  from  0  to  the  nearest  integer,  ||^||  =  min„^z  \0  —  n\.  If  0  is 
irrational  and  fc  is  a  non-zero  integer  then  kO  is  not  an  integer,  and  hence  by  the  above  with 
9  replaced  by  kO  we  see  that  UN{k)  =  Ojt(l)  when  Un  =  n9 .  In  this  way  we  see  easily 
that  the  sequence  n9  is  uniformly  distributed  if  9  is  irrational. 

The  proof  of  Weyl’s  Criterion  depends  on  the  existence  of  one-sided  approximations  to 
the  characteristic  function  Xj  interval  by  trigonometric  polynomials;  these  approxima¬ 
tions  should  be  close  in  the  norm.  Of  course  the  existence  of  such  trigonometric  polyno¬ 
mials  follows  easily  from  the  uniform  approximation  to  continuous  functions  by  trigonomet¬ 
ric  polynomials,  but  it  is  also  useful  to  put  this  in  a  quantitative  form.  Erdos  and  Tur^n  [9] 
showed  that  there  exist  trigonometric  polynomials  T_  and  T+  of  degree  at  most  K  such 
that  T-{x)  ^  Xj{^)  ^  ^+(^)  for  all  x  and  such  that  fjT±  =  a -h  0(1/K) ,  and  thus  that 


N 


K 


card{l  ^  n  ^  N  :  Un  E  [0,  a)  (mod  1)}  -  Na  ^  C—  -j-  C  y~] 


mk)\ 


k=l 


In  the  1970’s  Selberg  considered  how  the  large  sieve  could  be  refined,  and  in  doing  so  discov¬ 
ered  more  natural  functions  that  yield  very  sharp  constants  (see  Selberg  [32],  pp.  213-226). 
Indeed,  Beurling  [5]  defined  the  entire  function 


B(z) 


.  „  „  OO  -  OO  - 

/sm7r2:\2/2  ^  1  ^  1  \ 

\  TT  /  \  2:  ^  iz  —  n)^  [z  -f  n)2  / 

n=0  ' 


nY  "  (2  -f  n)2 


and  observed  that  B{z)  =  ,  that  B{x)  ^  sgn(a;)  for  all  real  x ,  and  that 

J'  B{x)  -  sgn(a:)  dx  =  \  . 

K 

Indeed,  Beurling  showed  that  among  functions  with  the  prior  properties,  this  function  unique¬ 
ly  minimizes  the  above  integral. 
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Figure  1.  Beurling’s  function  B{x) . 


If  /  =  [a,  6]  is  an  interval  of  the  real  line  and  (5  >  0  then  the  Selberg  majorant  and 
minorant  are 


S^{z)  =  \B{5{z-a))  +  \B{5{b-z)), 

S^{z)  =  -\B{5{a-z))-\B{S{z-b)). 

Then  S^{x)  ^  <  S^{x)  for  all  real  x ,  S{t)  =  0  for  \t\'^  6,  and  S±(x)  dx  = 

b  ~  a±l/6.  This  approximation  is  optimal  when  6(b  -  a)  is  an  integer,  and  in  any  case 
is  quite  good.  To  obtain  corresponding  approximations  for  the  circle  group  we  apply  the 
Poisson  summation  formula.  Suppose  that  b  -  a  <  1 ,  so  that  the  interval  [a,  b]  defines 
an  arc  of  T.  We  take  5  ==  iiT  +  1  and  set  T±{x)  =  En‘5'±(^  +  t)  .  Then  T±  is  a 
trigonometric  polynomial  of  degree  not  exceeding  K,  T^(x)  <  x^(a:)  <  T+{x)  for  all  x, 
and  T±{x)dx  =  b-a±  1/ {K  -\-l) .  This  is  optimal  when  {b  -  a)  {K  +  1)  is  an  integer, 
and  is  close  to  optimal  in  any  case. 

WeyPs  criterion  has  since  been  vastly  generalized  to  describe  the  weak  convergence  of  a 
sequence  of  measures  to  a  limiting  measure  fj,  in  terms  of  the  convergence  of  the  Fourier 
transforms  of  these  measures.  One  fruitful  generalization  is  to  T* .  Suppose  that  {«„}  is  a 
sequence  of  points  in  and  let  ^  =  [ai,  6i]  x  •  •  •  x  [nj,  ba]  denote  a  box  in  .  Then  the 
following  are  equivalent: 

(a)  limAT^oo  card{l  ^  n  ^  N  :  Un  e  "B}  —  yoI'B  for  every  box  !B  in  ; 

(b)  If  k  k^O,  then  J2n=i  ’  '^n)  =  o{N)  as  TV  oo ; 

(c)  If  F  is  properly  Riemann-integrable  on  then  limjv-^oo  - 

fjd  • 
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Quantitative  majorants  in  are  easily  obtained  by  forming  a  product  of  one-dimen¬ 
sional  majorants.  Minorants  are  a  little  more  elusive,  but  Barton,  Vaaler  and  Montgomery  [2] 
have  given  a  construction  that  works  pretty  well. 

In  the  same  way  that  we  used  Weyl’s  Criterion  to  see  that  the  sequence  {n9}  is  uniformly 
distributed  if  6  is  irrational,  we  can  use  Weyl’s  Criterion  for  to  obtain  a  sharpening  of 
Kronecker’s  Theorem.  Suppose  that  1,  ^i, . . . ,  are  linearly  independent  over  the  field  Q 
of  rational  numbers.  Then  the  points  nO  -  (n^i,  n02,  •  •  • ,  are  not  only  dense  in 
(Kronecker’s  Theorem)  but  are  actually  uniformly  distributed.  Indeed,  suppose  that  ^  is  a 
box  in  ,  and  let  T{x)  be  a  trigonometric  polynomial  that  minorizes  with  /jpd  T  >  0 . 
Then 

Af+N 

card{M  -hl^n<M  +  iV:n^G'B}^  ^  T(n0) 

n=Ar+l 


k  n=M+l 


^Nf(0)-lYl 


\T{k)\ 

ilfc.flir 


Here  the  last  sum  is  finite  because  there  are  only  finitely  many  k  for  which  T{k)  ^  0  and 
because  k  •  9  is  never  an  integer.  Since  f  (0)  >  0 ,  the  above  is  positive  if  N  is  sufficiently 
large.  The  remarkable  feature  here  is  that  the  expression  above  is  independent  of  M .  That  is, 
if  rii  <  n2  <  na  <  *  •  •  are  the  n  for  which  not  e'B  then  the  gaps  Tii+i-rii  are  uniformly 
bounded  above.  This  insight  is  critical  to  Bohr’s  definition  of  almost  periodicity. 


Definition  2.  Let  f  :  'R  C  be  continuous.  We  say  that  a  real  number  t  is  an  e 
-almost  period  of  /  if  \f{x  +  t)  -  f{x)\  <  e  for  all  real  x.  The  function  f  is  almost 
periodic  if  for  every  e  >  0  there  is  a  number  C  =  C(e)  such  that  any  interval  of  length  at 
least  C  contains  an  e  -almost  period. 

It  is  by  the  strengthened  form  of  Kronecker’s  Theorem  that  we  see  that  the  almost  periodic 
polynomial 

N 

T(x)  —  ^  ^  Qn^(^n^) 

n=l 

is  indeed  an  almost  periodic  function.  It  can  also  be  shown  that  the  almost  periodic  poly¬ 
nomials  are  dense  in  the  space  of  almost  periodic  functions.  Let  C(s)  = 

Riemann  zeta  function.  We  write  s  =  a-fit .  If  a  is  fixed,  a  >  1 ,  then  (^(cr  -fit)  is  an  almost 
periodic  function  of  t .  The  concept  of  almost  periodicity  can  be  generalized  to  other  norms. 
For  example,  when  a  is  fixed,  1/2  <  a  <  1  the  function  +  it)  is  a  mean-square  almost 
periodic  function,  even  though  it  is  not  an  almost  periodic  function  in  the  uniform  norm.  One 
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might  expect  that  by  almost  periodicity  one  could  show  that  if  the  zeta  function  has  a  zero 
with  real  part  >1/2  then  it  would  have  many  such  zeros,  all  with  approximately  the  same 
real  part.  However,  attempts  to  prove  such  an  assertion  have  so  far  been  unsuccessful.  If  one 
could  prove  such  a  thing  then  the  Riemann  Hypothesis  would  likely  follow,  since  we  have 
fairly  good  upper  bounds  on  the  number  of  zeros  of  the  zeta  function  with  large  real  part.  Let 
N(a,  T)  denote  the  number  of  zeros  of  in  the  rectangle  O^t^T.  Then 

N{a,T)  <C  T^^^^logT  where  0((7)  <  1  if  cr  >  1/2.  (Following  Vinogradov,  we  say  that 
f  g  if  f  ~  0{g) .  This  notation  saves  a  set  of  parentheses  when  there  is  no  main  term.) 
Although  we  have  not  succeeded  so  far  to  use  almost  periodicity  to  produce  many  zeros  from 
one,  we  do  know  that  translates  of  the  zeta  function  are  universal  among  non-zero  analytic 
functions,  in  the  following  sense:  Let  iR  be  a  rectangle,  01=  {s  :  ai  ^  cr  <  (72,  i  ^  ^2} 
with  l/2<cri<(72<l.  Let  /  be  analytic  and  non-zero  on  a  domain  containing  01.  Then 
for  any  £  >  0  there  exists  a  real  number  r  such  that  |/(5)  -  C(s  4-  ir)|  <  e  uniformly  for 
s  €  3^. 

almost  periodicity  also  arises  in  the  error  term  in  the  Prime  Number  Theorem.  Let  A(n) 
be  von  Mangoldt’s  lambda  function,  which  is  to  say  that  A(n)  =  logp  when  n  =  for 
some  positive  integer  k,  and  A(n)  =  0  otherwise.  We  put  ^(a:)  =  I3„<a,A(n).  By 
integration  by  parts  we  see  that  the  Prime  Number  Theorem  in  the  form  7r(i)  rsj  li(x)  is 
equivalent  to  the  assertion  that  ip{x)  X.  Since  -C^\s)/C^{s)  =  A(n)n  for  a  >  1 , 
we  can  recover  'ip{x)  from  the  logarithmic  derivative  of  the  zeta  function  by  an  inverse  Mellin 
transform: 


pc+ioo 
J  C— *00 


C' 


(s) 


By  moving  the  contour  to  the  left  we  see  that  this  is 


=  X  — 


c 

c 


1  °° 


Here  p  runs  over  all  the  non-trivial  zeros  of  the  zeta  function,  which  is  to  say  all  those  zeros 
with  positive  real  part.  This  explicit  formula  for  the  error  term  in  the  Prime  Number  Theorem 
is  essentially  one  that  Riemann  stated  and  von  Mangoldt  later  proved.  Write  p  =  ^  -h  ^7 .  In 
the  quantity  x^  =  ,  the  second  factor  oscillates,  but  more  slowly  as  x  increases.  To 

make  it  periodic  we  make  an  exponential  change  of  variables.  Suppose  also  that  the  Riemann 
Hypothesis  is  true,  which  is  to  say  that  ^  =  1/2  for  all  p.  Then 


^(e^)  — 
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This  expression  is  mean-square  almost  periodic,  and  the  sum  on  the  right  is  its  Fourier  ex¬ 
pansion.  It  is  generally  believed  that  the  7  >  0  are  linearly  independent  over  Q ,  so  that  the 
terms  behave  like  independent  random  variables. 

2.  Exponential  Sums 

In  his  seminal  work  [36,37],  Hermann  Weyl  not  only  gave  his  criterion  for  uniform  distri¬ 
bution,  but  also  a  useful  method  for  estimating  exponential  sums  of  the  form  X)n=i  ^(^(^)) 
where  P  is  a  polynomial  with  real  coefficients.  Indeed,  today  such  an  exponential  sum  is 
called  a  Weyl  sum.  Weyl’s  basic  observation  was  that 

I  E  =  E  E  e(FH  -  P(n)) 

n=l  n=:l  m=l 

=  E  E  e{P(n  +  h)-P{n)) 

n=l h=\-n 

=  E  E  e{P{n  +  h)  -  P{n)) 

/i=_N+l 

1-h-^n^N-h 
N-l  N-h 

=  23?  ^  ^  e(P(n  +  h)  ~  P{n)). 

h=l  n=l 

This  manipulation  is  known  as  ‘Weyl  differencing’.  If  P(x)  has  degree  d  then  P(x  h)  - 
P{x)  has  degree  d  -  I  when  /i  ^  0 .  Hence  if  we  perform  this  differencing  d~  I  times 
then  we  are  left  with  a  geometric  series,  which  we  know  how  to  estimate.  In  this  way,  Weyl 
showed  that  if  P{x)  —  Ciix'  is  a  polynomial  with  real  coefficients,  and  if  at  least  one  of 
the  numbers  ai,  02, .  is  irrational,  then  the  sequence  {P(ri)}  is  uniformly  distributed 
modulo  1 . 

In  the  Weyl  differencing,  it  is  somewhat  a  disadvantage  that  the  parameter  h  runs  all  the 
way  from  1  to  -  1 .  Later,  van  der  Corput  found  that  h  can  be  restricted.  For  1  N 

let  yn  be  a  complex  number,  and  suppose  that  i/„  =  0  if  n  <  1  ot  n  >  N .  Then 

(^  +  1)"|E2'"|  =1EE2'"+'‘ 

n  h=0  n 

=  |EE2'"+'- 

n  h—0 
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By  Cauchy’s  inequality  this  is 

I  ^  2 

^{N  +  H)'£\'Eyr,+h 

n  /i=0 
H  H 

=  {n  +  h)^Y.I1  Vn+hyn+k 

h=0  k=0  n 

=  (N  +  H){H  +  l)'£\yn\'‘ 

n 

H 

+  2(iV  +  H)3fi  +  1  -  r)  ^  . 

T=\  n 


Thus  we  obtain  van  der  Corput’s  inequality,  which  asserts  that 


f  r 

n=l 


N-\-H 
H  +  1 


n=l 


2(Ar  +  ff)  "  / 


h 


N-h 


yn+hyn\  ■ 


On  taking  =  e{kun)  and  applying  Weyl’s  Criterion  twice,  we  obtain 


Theorem  1.  (van  der  Corput)  If  for  each  positive  integer  h  the  sequence  {un+h  —  Un} 
is  uniformly  distributed  (mod  1 ),  then  the  sequence  {%}  is  uniformly  distributed  (mod  1 ). 

More  recently  it  has  been  recognized  that  the  above  remains  true  even  when  h  is  re¬ 
stricted  to  lie  in  certain  subsets  of  positive  integers;  we  say  that  is  a  van  der  Corput  set 
if  the  above  is  true  when  h  is  restricted  to  lie  in  This  is  equivalent  to  the  existence  of 
non-negative  cosine  polynomials 

T{x)  —  flo  +  ^  o-h  cos  27rhx 

l^h^H 

with  r(0)  =  1  and  ao  arbitrarily  small.  In  this  context  it  is  no  accident  that  we  see  the 
coefficients  of  the  Fejer  kernel  in  van  der  Corput’s  inequality.  Since  the  set  of  perfect  squares 
constitute  a  van  der  Corput  set,  there  exist  non-negative  cosine  polynomials  of  the  form 

H 

T{x)  =  do  +  ^  Cih  cos  2'Kh^x 

h=i 

with  T(0)  =  1  and  ao  arbitrarily  small,  but  it  is  not  known  how  rapidly  ao  tends  to  0  as 
H  -A-  oo. 
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van  der  Corput  devised  a  general  method  for  estimating  exponential  sums  of  the  form 
ip))  where  /  is  a  sufficiently  smooth  real  valued  function.  As  an  example  of  a 
simple  first  result  of  this  type,  we  mention  that  if  0  <  A2  ^  f"{x)  ^  AX2  for  a  ^  x  ^  b 
then 

^  e(/(n))«^Af(6-a)  +  A2'/^ 

a^n^b 

In  Process  A  of  van  der  Corput,  one  exponential  sum  is  made  to  give  rise  to  another  by  a 
suitable  application  of  the  van  der  Corput  inequality.  This  method  is  destructive  in  the  sense 
that  usually  some  cancellation  is  lost.  In  van  der  Corput’s  Process  B,  one  takes  r(a:)  = 
e{f{x))  for  a  ^  X  ^  and  r(j:)  =  0  otherwise.  Then  by  the  Poisson  summation  formula, 

'■(")  =  =  X!  /  e(f(x)-vx)dx. 

If  f  is  strictly  increasing  with  /'(a)  =  a,  /'(6)  -  then  for  a  ^  1/  ^  we  obtain  a 
stationary  phase  at  x^  where  r\x^)  -  u.  If  f"(x)>0foTa^x^b  then  the  above  is 
approximately 

Thus  the  problem  of  estimating  one  sum  is  reduced  to  that  of  estimating  another.  Process  B  is 
non-destructive,  since  a  second  application  of  it  takes  us  back  to  our  initial  sum.  For  a  certain 
class  of  functions  / ,  these  two  processes  lead  to  estimates  of  the  form 

^  e(/(n))  <  (max|/'|)*(6-  a)' 

for  certain  pairs  (k^i)  in  the  square  0  ^  ^  1/2  <  ^  ^  1 .  If  (^,  ^)  is  an  exponent  pair 

then  Process  A  gives  the  exponent  pair  {2^,  ^2^  )  ’  Process  B  gives  the  exponent  pair 
{i  —  1/2,  k  +  1/2) .  It  is  trivial  that  (0, 1)  is  an  exponent  pair.  By  Process  B  it  follows  that 
(1/2, 1/2)  is  an  exponent  pair,  and  by  Process  A  this  yields  the  further  pair  (1/6, 2/3) .  The 
collection  of  exponent  pairs  that  can  be  obtained  in  this  way  is  indicated  in  Figure  2  below. 

Recently,  Huxley  [17],  building  on  work  of  Bombieri  and  Iwaniec  [6],  has  slightly  en¬ 
larged  the  region  of  known  exponent  pairs,  but  we  are  still  far  from  proving  the  conjecture 
that  {kj  £)  is  an  exponent  pair  if  k  >  0  and  £  >  1/2 .  This  is  a  quite  deep  conjecture, 
since  the  special  case  f{x)  =  flog  a:  yields  the  Lindeldf  Hypothesis,  which  asserts  that 
({1/2  +  it)  ^  f  for  every  e  >  0 .  Some  useful  exponent  pairs  are  given  in  Table  1. 

Quantitative  estimates  can  also  be  derived  for  the  Weyl  sum  ^(^(^))  terms  of 
the  rational  approximations  to  the  coefficients  of  P .  For  example,  by  Weyl’s  method  we  find 
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Figure  2.  Exponent  Pairs  derived  by  van  der  Corput’s  processes. 


that  if  P{x)  =  Yh=o  and  \ad  -  a/q\  <  l/q^  where  (a,  g)  =  1 ,  then 

Y,  («))  i^)  • 

n=l  ^ 

Here  the  most  favorable  circumstance  is  when  N  ^  q  ^  ,  in  which  case  the  upper 

bound  is  of  the  order  .  This  is  only  slightly  better  than  the  trivial  bound  if  d 

is  large,  and  it  falls  far  short  of  what  we  conjecture,  which  is  that 

n=l  ^ 

Here  the  most  favorable  situation  arises  when  q  «  ,  and  then  the  upper  bound  is  of  the 

order  ^ 

This  leads  to  a  non-trivial  estimate  for  \S(ot)\ . 
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Table  1.  Some  Exponent  Pairs. 


Operation 

(0,1) 

(1/254,247/254) 

AAAAAAB 

0/126,20/21) 

AAAAAB 

(1/86,161/172) 

AAAABAAB 

(1/62,57/62) 

AAAAAB 

(1/50,181/200) 

AAABABAAB 

(1/42,25/28) 

AAABAAB 

(2/53,181/212) 

AABABAAAB 

(1/24,27/32) 

AABABAAB 

(1/22,101/121) 

AABABABAAB 

(1/20,33/40) 

AABAAB 

(13/238,97/119) 

AABAABAAB 

(11/186,25/31) 

AABAAAB 

(4/49,75/98) 

ABABAAAB 

(11/128,97/128) 

ABABAABAAB 

(1/11,3/4) 

ABABAAB 

(1/10,81/110) 

ABABABAAB 

(13/106,75/106) 

ABAABAAB 

(11/86,181/258) 

ABAABABAAB 

(11/78,161/234) 

ABAAABAAB 

(22/117,25/39) 

BABAAABAAB 

(26/129,27/43) 

BABAABABAAB 

(11/53,33/53) 

BABAABAAB 

(13/55,3/5) 

BABABABAAB 

(1/4,13/22) 

BABABAAB 

(33/128,75/128) 

BABABAABAAB 

(13/49,57/98) 

BABABAAAB 

(19/62,52/93) 

BAABAAAB 

(75/238,66/119) 

BAABAABAAB 

(13/40,11/20) 

BAABAAB 

(81/242,6/11) 

BAABABABAAB 

(11/32,13/24) 

BAABABAAB 

(75/212,57/106) 

BAABABAAAB 

(11/28,11/21) 

BAAABAAB 

(81/200,13/25) 

BAAABABAAB 

(13/31,16/31) 

BAAAAB 

(75/172,22/43) 

BAAAABAAB 

(19/42,32/63) 

BAAAAAB 

(60/127,64/127) 

BAAAAAAB 

(1/2, 1/2) 

B 
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3.  Power  Sums 

Although  we  spend  a  lot  of  effort  to  estimate  exponential  sums,  in  the  opposite  direction  it  is 
sometimes  useful  to  show  that  a  cancelling  sum  is  not  always  too  cancelling.  Turan’s  method 
of  power  sums  provides  tools  of  exactly  this  sort.  Since  error  terms  in  analytic  number  theory 
are  often  expressed  as  a  sum  of  oscillatory  terms  (as  we  have  already  seen  in  the  case  of  the 
error  term  in  the  Prime  Number  Theorem),  Turan’s  method  assists  us  in  proving  that  such 
error  terms  are  sometimes  large.  To  exemplify  the  method,  we  describe  Turan’s  First  Main 
Theorem.  Let 

N 

n=l 

and  suppose  that  \zn\  >  1  for  all  n.  Suppose  that  M  is  a  given  non-negative  integer.  We 
wish  to  show  that  there  is2ii',M-hl^u^M  +  N,  such  that  |st,|  is  not  too  small 
compared  with  |so| .  To  this  end  we  employ  a  simple  duality  argument,  which  is  typical  of 
Turk’s  method.  Suppose  that  numbers  have  been  determined  so  that 

N-\ 

(3.1)  So  ~  ^  ^ 

i/=0 

Then 

N-l 

|soK  (  XI  • 

^  j/=0 

By  the  definition  of  we  see  that  (3.1)  asserts  that 

AT  N  N-l 

X  =  X  X  • 

n=l  n=l  i/=0 

This  identity  certainly  holds  for  arbitrary  bn  provided  that 

i/=0 

for  1  <  n  <  iV .  That  is,  P{z)  —  J2u=q  should  be  a  polynomial  of  degree  at  most 
N  ~1  that  satisfies  the  N  conditions  P{zn)  =  .  Without  loss  of  generality  the  Zn 

are  distinct,  and  hence  P{z)  is  uniquely  determined.  It  can  be  shown  that  ^ 

,  and  thus  we  find  that 
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The  constant  here  is  best-possible,  but  it  is  disappointingly  small,  since  it  is  only  a  little  larger 
than 

/  N 

\2e{MA-N))  ' 

Suppose  that  T{x)  is  an  exponential  polynomial  of  N  terms  and  period  1 ,  say 

N 

T{x)  =  ^6„e(A„a:) 

n=l 

where  the  A„  are  integers.  Let  /  be  a  closed  arc  of  the  circle  group  T ,  and  let  L  denote  the 
length  of  I ,  Then  by  Turin’s  First  Main  Theorem,  it  is  easy  to  show  that 


inax|T(i)|>(-)  max|r(x)|. 


Although  the  constant  here  depends  on  the  number  N  of  terms  in  T(j:)  ,  it  is  noteworthy 
that  it  is  independent  of  the  size  of  the  frequencies  A„  .  This  inequality  makes  it  possible  to 
give  a  simple  and  motivated  proof  of  the  Fabry  Gap  Theorem. 

The  small  constant  c(M,  N)  can  be  replaced  by  a  larger  constant  if  one  is  prepared  to 
allow  u  to  run  over  a  range  longer  than  N .  In  more  restricted  situations  the  lower  bound 
can  be  very  good  indeed.  For  example,  suppose  that 

N 

(3.2)  = 


N  N  K 


Tn=l  n=l  v—l 


Since  the  expression  is  real,  we  may  take  real  parts  to  see  that  the  above  is 


N  N  K 


~  ~  ^n)  —  ^(^K+li^m  ^n)  1) 


7n=l  n=l  i/=l 


where  Ak+i(^)  denotes  the  Fej6r  kernel.  Since  A;c+i(^)  ^  0  for  all  0,and  Ak+i(0)  — 
AT  -I- 1 ,  it  follows  that  the  above  is 
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Thus  if  {I -{■  s)N  then  |5^|  ^  C{e)y/N ,  said  in  particular 

max  |si,|  ^  \/N/2  . 

By  working  similarly  with  higher  moments  one  can  obtain  still  better  lower  bounds  over 
longer  ranges  of  I'l  If  is  given  by  (3.2)  and  1  ^  m  ^  N/2  then  there  is  a  i/ ,  1  ^  z/  ^ 
(12A^/m)”^  such  that  |5i/|  >  IVmN .  It  would  be  useful  to  have  examples  to  show  that  this 
is  close  to  best-possible. 

For  a  more  extensive  survey  of  Turan’s  method  one  may  consult  Montgomery  [21],  pp. 
85-107.  For  a  detailed  account  of  the  subject  and  important  applications,  one  should  see  the 
book  of  Turan  [33]. 

4.  Irregularities  of  Distribution 

We  now  consider  how  well-distributed  N  points  can  be.  If  the  points  are  to  fall  in  [0, 1]  then 
we  could  take  Un  =  n/N .  These  points  are  extremely  well-distributed  in  the  sense  that  the 
number  of  them  in  a  subinterval  [0,  a]  is  N a  -b  0(1) .  However,  we  find  that  it  is  not  so  easy 
to  distribute  points  well  in  .  Suppose  that  our  points  are  =  (izi,  U2) ,  let  5i(a)  be  the 
rectangle  51(a)  =  [0,  ai]  x  [0, 0:2] ,  and  let  D{a)  be  the  discrepancy  function 

D(a)  =  card{l  ^  n  ^  iV  :  G  Ik(a)}  -  Naia2  . 

Roth  [30]  used  a  construction  suggestive  of  wavelets  to  show  that 

/  D{a)'^  da^logN . 

Since  it  is  also  possible  to  construct  points  that  are  this  well-distributed,  this  solves  the  prob¬ 
lem  as  to  distribution  in  mean-square.  Ostrowski  [27]  had  observed  that  D{(x)  <C  logiV 
when  Un  =  {n/N,ny/2) .  The  problem  of  showing  that  in  any  case  ||-D||oo  >  logA^  was 
solved  by  Schmidt  [31]  by  means  of  a  complicated  induction.  However,  a  curious  difference 
arises  here.  Roth’s  argument  generalizes  easily  to  T*  to  show  that  \\D\\2  (log 
for  any  k  >  1.  However,  Schmidt’s  approach  has  not  been  extended  to  k  >  2.  It  has 
been  conjectured  that  ||i7l|oo  '>k  .  This  would  be  best  possible,  in  view  of  con¬ 

structions  of  Halton  [14].  On  the  other  hand,  Pollington  has  recently  mounted  a  wavelet 
approach  to  this  problem  that  has  led  him  to  conclude  that  one  should  be  able  to  show  that 
Halloo  (logiV)*=/2 ,  and  that  this  sup  norm  need  not  be  larger.  Hence  one  should  regard 
the  question  of  how  large  |li7||oo  need  be  to  be  a  wide  open  unsolved  problem  when  k>  2. 
Halasz  [13]  devised  a  variant  of  Roth’s  method  that  gives  Schmidt’s  Theorem  concerning 
||D||oo  when  =  2,  and  also  gives  the  lower  bound  ||i7||i  >  \/iog  N  when  k  —  2.  This 
is  best  possible,  since  Chen  [8]  has  shown  that  if  0  <  p  <  00  and  k  is  given,  k  ^  2,  then 
there  exists  a  configuration  of  N  points  in  for  which  ||D||p  k  (log 
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Roth  obtained  his  lower  bound  by  the  Cauchy-Schwarz  inequality, 

f  D{a)F{a)da  ^  (  [  D(a)^da^  (  f  F(a)^da^ 

where  F  is  a  test  function  defined  to  be  a  sum,  F  —  J]  ,  of  loore  basic  orthogonal 
functions.  Haldsz  similarly  used  the  inequality 

[  D(a)F(a)da^||D||oo||F|h 

JT2 

where  F  ==  flr(f  •  This  bears  a  strong  resemblance  to  a  Riesz  product,  as  occurs  in 

the  theory  of  lacunary  trigonometric  series.  For  the  lower  bound  of  ||F||i ,  he  wrote 

[  F(a)F(a)da^||F|M|F|U 

where 


Since  — 1  ^  F^  ^  1 ,  it  follows  that  \F\  ^  (1  -f  1/F)^  <C  1 . 

Let  e{xi),e(x2), ...  be  an  infinite  sequence  of  unimodular  complex  numbers,  and  put 
Pn{z)  —  nn=i('^  ~  e(2;„)) .  Erdos  asked  whether  it  is  possible  to  choose  the  Xn  in  such  a 
way  that  the  numbers  max|^[;^i  \Pn{z)\  are  bounded  as  N  oo.  Note  that  ^  logF;v(e(rc)) 
is  just  the  discrepancy  of  the  first  N  points,  while  the  problem  now  being  considered  involves 
the  harmonic  conjugate  3RlogFyv(e(a:)) ,  but  with  the  important  difference  that  we  need  this 
quantity  to  be  large  and  positive,  not  just  large  in  absolute  value.  That  this  sequence  can  not 
remain  bounded  was  first  proved  by  Wagner  [35],  by  means  of  a  modified  form  of  Schmidt’s 
method.  Later  Beck  [3]  used  Halasz’s  modified  form  of  Roth’s  method  to  obtain  this  in  the 
following  sharp  quantitative  form:  There  is  an  absolute  constant  (^  >  0  such  that 

max  I  Fat  (z)  I  > 
kKi 

for  infinitely  many  N . 

Measuring  the  distribution  of  points  in  relative  to  rectangles  with  sides  parallel  to  the 
coordinate  axes  is  only  one  of  many  possibilities.  If  S  is  a  measurable  set  then  the  quantity 

D{B)  =  card{l  ^  n  ^  N  :  Un  ^  B}  —  N vo\S 

provides  a  measure  of  the  distribution  of  the  .  But  when  we  consider  S  we  would  also 
include  its  translates,  so  we  put  d{a)  —  D{S  +  a) .  Then  d(0)  -  0 ,  but  for  k  we  have 
d{k)  -  Xg{"f^)UN{k)  where  UN{k)  =  e{-k  •  tin)  •  Hence  by  Parseval’s  identity, 

•'T*  *j£0 
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The  rate  that  Xg  (^)  tends  to  0  as  A;  tends  to  infinity  in  a  particular  direction  depends  on 
how  much  of  the  boundary  of  S  is  orthogonal  to  the  direction  in  question.  Thus  in  the  case 
of  a  rectangle  the  Fourier  coefficients  decay  slowly  in  the  direction  of  the  coordinate  axes, 
but  comparatively  rapidly  in  other  directions.  Thus  one  may  expect  that  points  may  be  found 
so  that  C/iv(fc)  is  small  when  Xg  (k)  is  large,  and  vice  versa.  By  using  the  Fejer  kernel  as  in 
our  discussion  of  power  sums  we  can  show  that 

|{/jv(fc)P  ^  AfXA  -  JV^ 

\kiKXi 

1*2 1^X2 

for  any  positive  real  numbers  Xi,X2 .  In  this  way  we  can  recover  Roth’s  lower  bound  for 
\\D\\2  .  By  averaging  over  disks  (and  averaging  over  the  radius  as  well)  we  find  that  there 
is  a  disk  D  for  which  D(T>)  ^  .  More  generally,  if  we  start  with  a  set  8,  and  are 

allowed  to  shrink,  translate,  and  rotate  8,  then  we  obtain  this  larger  order  of  magnitude,  in 
view  of  a  general  principle  governing  the  mean  square  decay  of  the  Fourier  transform  of  the 
characteristic  function  of  a  set:  If  C  is  a  simple,  closed,  piecewise  curve  in  ,  and  8  is 
its  interior,  then 


as  00 .  This  is  due  to  Montgomery  [19],  [20]  pp.  1 14-1 19;  see  also  Herz  [15, 16]. 


5.  The  Large  Sieve 


The  large  sieve  was  originated  by  Linnik  [18]  in  a  somewhat  obscure  form.  It  gained  new  life 
in  the  hands  of  Renyi  [28],  who  viewed  it  as  a  statement  about  almost  independent  vectors. 
Today  we  usually  think  of  this  as  an  extension  of  Bessel’s  inequality  for  vectors  in  an  inner 
product  space:  Let  <j)i, . . .  ,(I)r  be  arbitrary  vectors  in  an  inner  product  space.  Then  the 
following  three  assertions  concerning  the  constant  C  are  equivalent: 

(a)  For  any  vector  ^  in  the  space. 


r=l 

(b)  For  any  complex  numbers  Ur  we  have 


^  ^  '^rUs{(l>ri  <f>s) 
l^r.s^R 


r=l 


(C)  C  =  p{[{,^r,4>s)]). 
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(Here  p{A)  denotes  the  spectral  radius  of  the  matrix  A .)  Renyi  took  the  coordinates  of  his 
vectors  to  depend  on  arithmetic  progressions  or  on  Dirichlet  characters,  but  the  vectors  he 
obtained  in  doing  so  were  not  very  close  to  orthogonal,  so  the  estimates  he  obtained  were 
imperfect.  Roth  [29]  had  the  excellent  idea  of  taking  —  (e(nar))  where  the  are  well¬ 
spaced  in  T .  These  vectors  are  quite  close  to  being  orthogonal,  and  we  now  think  of  the  most 
basic  form  of  the  large  sieve  as  being  a  statement  about  the  mean  square  of  a  trigonometric 
polynomial  at  well-spaced  points.  That  is,  we  take 

M+N 

n=M+l 


and  we  consider  inequalities  of  the  shape 

R  M+AT 

^  |a„p. 

r=l  n=M+l 

We  suppose  that  ||ar  —  ^  for  r  ^  s,  and  want  A  to  depend  on  N  and  6.  If 

a„  =  e(— ncKi)  then  5'(ai)  =  N  ^  and  thus  we  must  have  A  ^  N .  By  averaging  over 
translations  of  the  ar  we  can  also  show  that  A  ^  5“^  -  1 .  We  find  that  A  does  not  have  to 
be  much  larger  than  is  required  by  these  considerations. 

Gallagher  [10]  used  an  inequality  of  the  Sobolev  type, 

1  pa+5/2  1  pa+6/2 

l/(a)K  7  /  \fi^)\dx+-  \f'{x)\dx, 

«  Ja-5/2  ^  Ja-d/2 

to  show  that  one  can  take  A  =  1/5  -f  ttN  .  This  is  the  best  constant  with  respect  to  5, 
but  in  arithmetic  settings  the  coefficient  of  N  is  more  important.  The  main  advantage  of 
Gallagher’s  approach  is  that  it  generalizes  readily  to  other  families  of  functions.  To  obtain 
good  dependence  on  N  we  note  that  by  duality  the  stated  inequality  is  equivalent  to  the 
inequality 

M+N  R  2  ^ 

n=A/+l  r=l  r=l 

On  the  left  hand  side  we  square  out  and  take  the  sum  over  n  inside.  The  diagonal  terms  give 
Ix/rP »  so  it  remains  to  consider  the  non-diagonal  terms.  This  brings  us  to  Hilbert’s 
Inequality,  which  asserts  that 
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Montgomery  and  Vaughan  [23],  with  some  assistance  from  Selberg,  found  that  this  can  be 
generalized,  so  that 


E 


VrVs 


E 


rz/s 


provided  that  |Ar  As|  ^  6  whenever  r  ^  s.  Moreover  for  the  circle  group  we  have, 
correspondingly,  the  inequality 


E 

r,s 

r^s 


yrVs 

sin7r(ar  —  a^) 


where  jjar  -  a^H  ^  J  for  r  ^  s .  This  gives  the  large  sieve  with  the  factor  A  =  N  -\-  l/S . 
With  a  little  more  care  one  can  obtain  A  =  N  1/5  —  1  ^  and  with  this  constant  there  are 
situations  in  which  equality  can  occur. 

In  arithmetic  situations  the  are  usually  taken  to  be  the  Farey  fractions  a/q,m  which 
(a,  g)  —  1  and  q  ^  Q .  Since  \\a/q  —  a'/q'\\  ^  1/teO  ^  when  a/q  and  o! /q/  are 
distinct  modulo  1,  we  find  that 


Q  M+N 

EE  1*5' (u/^)  I  ^  ( V  +  Q  )  ^  |On|  • 

q=l  a=l  n=M+l 

(a,9)=l 


yvVs 


Ury 


The  generalized  Hilbert  Inequality  can  also  be  established  in  a  weighted  form,  in  which 
we  find  that 

r,s  '  T  ^ 

r^s 

where  |Ar  -  A^j  >  when  s  .  Here  the  constant  |7r  is  certainly  not  best  possible.  The 
above  also  has  a  counterpart  for  the  circle  group,  and  hence  we  have  a  weighted  form  of  the 
large  sieve, 

R  I  r*/  \  12  M+N 

£&«  E  l‘‘■^ 

r=l  2Sr  n=M+l 

Many  other  variants  of  the  large  sieve  have  been  derived,  involving,  for  example,  maxi¬ 
mal  partial  sums  (via  the  Carleson-Hunt  Theorem),  or  the  Hardy-Littlewood  Maximal  Theo¬ 
rem.  As  for  further  generalizations  of  Hilbert’s  Inequality,  Montgomery  and  Vaaler  [22]  have 
shown  that  if  pr  =  Pr  +  hr  with  /3r  ^  0  for  all  r  and  |7r  —  7^1  ^  <5^  for  s  ^  r ,  then 

lE^h^Eif  ■ 

r^s 
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The  proof  depends  on  the  theory  of  functions  that  are  analytic  in  a  half-plane. 


6.  Dirichlet  Series 

Let  D{s)  =  •  By  Hilbert’s  Inequality  we  see  that 


and  the  weighted  Hilbert  Inequality  gives 

r  \D{it)\^dt^ 
Jo 


{T  +  0(N))J2Kf, 

n=l 


y]|a„P(r  +  0{n)). 

n=l 


Thus  we  have  some  limitation  on  the  amount  of  time  that  \D\  is  large.  Suppose  that  ^  1 
for  all  n .  Then  by  applying  the  above  to  we  find  that 

\D{it)\Ut  <  (T  +  , 

It  would  be  very  useful  if  we  could  interpolate  between  these  estimates,  in  the  sense  that 

\D{it)  1^  di  <  (T  + 

for  real  q,  2  ^  q  ^  4.  Indeed,  this  implies  the  Density  Hypothesis  concerning  the  Riemann 
zeta  function.  Many  years  ago,  Hardy  and  Littlewood  had  conjectured  that  if  \f{k)\  ^  F{k) 
for  all  k  and  q  2  then  ||/|lg  II » <ind  it  can  be  shown  that  this  majorant  conjecture 
would  imply  the  conjecture  above.  However,  Bachelis  [1]  used  a  method  of  Katznelson  to 
show  that  this  is  true  only  when  q  is  an  even  integer  (in  which  case  it  holds  with  constant  1). 

In  any  case  there  is  more  that  can  be  said  about  the  number  of  times  a  Dirichlet  polynomial 
can  be  large  than  follows  from  moment  estimates.  For  example,  if  0  ^  <  ^2  <  •  ■  ■  <  ^ 

T  and  U+i  for  all  r ,  if  \D{itr)\  ^  V  for  all  r ,  and  if  |a„|  ^  1  for  all  n,  then 

it  is  known  that  R  provided  that  .  It  has  been  conjectured  that 

this  estimate  for  R  holds  when  the  last  condition  is  weakened  to  read  ^  NT^ .  However, 
Bourgain  [7]  has  shown  that  if  this  is  so  then  every  Kakeya  set  in  EJ  ,  d  ^  2 ,  has  Hausdorff 
dimension  d ,  Thus  there  are  those  that  doubt  such  a  strong  conjecture. 


7.  Spectral  Characteristics  of  Zeros  of  the  Zeta  Function 

Let  h{d)  denote  the  class  number  of  the  quadratic  number  field  with  discriminant  d .  In  an 
effort  to  derive  a  useful  lower  bound  for  h(d)  when  d  is  negative,  it  was  recognized  that 
it  would  suffice  to  have  a  good  supply  of  pairs  of  zeros  of  the  Riemann  zeta  function  that 
are  <  apart  where  c  <  1/2.  With  this  motivation,  an  attempt  was  made  in  1971  to 
determine  the  distribution  of  7  —  7'  as  7  and  7'  run  over  nearby  ordinates  of  zeros  of  the 
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zeta  function.  One  begins  with  a  generalization  of  the  explicit  formula  noted  earlier.  We 
observe  that 


c+ioo  /-/ 


C  /  ^ 

— (s  +  w) —  aw 

C  w 


=  -<<*>  +  r:7 


°°  ^—2n—s 


p  -  s  “  2n~^s 

p  n=l 


We  assume  the  Riemann  Hypothesis  and  combine  two  such  explicit  formulae  to  see  that 


n>x 


-\-X  +0{x^^yt)  . 


We  take  the  modulus  squared  of  both  sides,  and  integrate  over  0  <  f  <  T .  We  write  x  =  T^ 
for  0  ^  Q:  ^  1 ,  and  note  that  the  expression  on  the  right  hand  side  can  be  asymptotically 
evaluated  by  using  our  mean  square  estimate  for  Dirichlet  polynomials.  Let 

F(a)  =  (Liogr)-'  Y. 

0<7^T 

o<y^T 

where  w(u)  =  4/(4  +  u^) .  Then  we  find  that  F  is  real,  even,  non-negative,  and  F{a)  = 
(1  -I-  o(l))T“^“ logT  4-  a  -H  o(l)  as  T  ->  oo,  uniformly  for  0  ^  a  <  1 .  When  a  >  1 
the  method  fails  because  the  Dirichlet  polynomial  is  too  long.  But  only  the  terms  near  the 
diagonal  contribute,  and  one  can  use  the  Hardy-Littlewood  quantitative  form  of  the  Twin 
Prime  Conjecture  to  estimate  that  those  terms  contribute.  In  this  way  one  is  led  to  guess  that 
F(q:)  =  l-t-o(l)  uniformly  for  1  ^  o;  ^  A ,  for  any  A>  I .  This  is  known  as  the  Strong  Pair 
Correlation  Conjecture.  By  taking  Fourier  transforms,  we  are  led  to  a  conjecture  concerning 
the  distribution  of  the  frequencies  7  —  7' :  The  number  of  pairs  7, 7'  of  ordinates  of  zeros, 
0  <  7  <  T ,  0  <  7'  <  T ,  for  which  27va/  logT  ^  7  —  7'  <  2nl3/  logT  is  asymptotic  to 

/sin7r?zN2  \T.  _ 

Here  (5  =  1  if  0  E  [o;,  ^] ,  and  <5  =  0  otherwise.  That  is,  <5  is  a  Dirac  point  mass  at  0 . 
This  arises  because  of  the  possibility  that  7  =  7'  when  a  <  0  <  ^ .  This  is  the  Weak 
Pair  Correlation  Conjecture.  Freeman  Dyson  observed  that  the  density  function  here  is  that 
of  a  random  hermitian  matrix  of  unitary  type,  and  thus  we  take  the  above,  although  only  a 
conjecture,  as  evidence  that  the  zeros  of  the  zeta  function  are  spectral  in  nature. 
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Goldston  and  Montgomery  [12]  showed  that  if  the  Riemann  Hypothesis  is  true  then  the 
Strong  Pair  Correlation  Conjecture  is  equivalent  to  the  estimate 


+  /i)  —  il){x)  ~  hY  dx  ~  hX  \ogX/h 

for  x^  ^  h  ^  x^  ^ .  A  heuristic  argument  in  favor  of  this  can  be  obtained  by  expanding  out 
and  using  the  Hardy-Littlewood  Conjecture  concerning  the  number  of  d  -twin  primes  not 
exceeding  x . 

Recently  Montgomery  and  Soundararajan  [26]  considered  the  higher  moments 

1 

Hk{X,h)  =  —j  (rl>(x  +  h)  -  ip(x)  -  h)'‘ dx . 

On  expanding,  one  encounters  enumerations  of  prime  -tuples.  The  Hardy-Littlewood  Prime 
k -tuple  Conjecture  asserts  that  if  di,  ^2?  •  •  •  distinct  integers  then 

k 

EH  A(n  +  di)  -  e('D)X-{-E{X,Ti) 

n^X  i=l 


where  ©(D)  is  the ‘singular  series’ 


6(D) 


TT 

l^qiKoo  i=l 


E  < 

{ai,qi)=l 

X)at/9i€Z 


(Et)- 


Gallagher  [11]  considered  the  moments  /ijt(X,  h)  for  smaller  h  of  the  form  h  =  clog X , 
and  for  that  purpose  showed  that 

E 

O^di^X 
di  distinct 


as  X  ^  oo .  For  larger  h ,  the  mean  value  h  is  much  larger  than  the  usual  size  of  the 
difference  'tl)(x  h)  -  '^(x)  -  h,  and  so  it  is  useful  to  consider  the  arithmetic  function 
Ao(n)  =  A(n)  —  1 ,  whose  mean  value  is  asymptotically  zero.  For  this  function  we  have  an 
alternative  formulation  of  the  prime  /c-tuple  Conjecture,  which  asserts  that 

k 

En  Ao(n  +  di)  =  eomx  +  E^{X,D) 

ni.X  i=l 
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where  now 


6„(D) 


=  E  E  KEf)^ 


\<qi<oo 


(ai,9i)=l 


Thus  6o  is  the  same  as  ©  except  that  the  possibility  that  =  1  is  now  excluded.  The  mean 
value  of  ©0  is  of  course  smaller,  and  hence  more  difficult  to  determine,  but  by  elaborating 
on  work  of  Montgomery  and  Vaughan  [24]  concerning  the  distribution  of  reduced  residues 
modulo  q  in  short  intervals  it  can  be  shown  that 


\  if  fc  is  odd. 

di  distinct 

This  is  established  unconditionally;  when  combined  with  plausible  hypotheses  concerning 
the  size  and  behavior  of  the  error  terms  E{X^  h)  we  are  led  to  expect  that 
MX,h)  =  {c,+oil))Xh'‘/‘^{logX/hf/^ 


where  the  c*  are  the  moments  of  the  normalized  normal  variable. 


Ck 


k\ 

(A:/2)!2V2 

0 


if  k  is  even, 
if  k  is  odd. 


Since  these  moments  occur  uniquely  in  the  case  of  normal  distribution,  we  are  led  to  expect 
that  the  distribution  of  ip{x  h)  -  'ip(x) ,  for  0  ^  x  ^  X ,  is  approximately  normal  with 
mean  h  and  variance  h\ogX/h. 

Suppose  that  X  >  T .  In  terms  of  zeros  of  the  zeta  function,  the  Strong  Pair  Correlation 
Conjecture  seems  to  be  telling  us  that  the  mean  square  size  of  the  sum 


^  cos7loga;  (0  ^  a;  <  X) 

0<7^T 


is  the  same  as  if  the  terms  were  uncorrelated  random  variables.  (We  recall  from  the  theory 
of  probability  that  if  Xi  are  uncorrelated  variables  then  Var(X)Xi)  =  5])Var(Xi).)  It 
seems  that  our  new  speculations  concerning  the  jj-k  can  similarly  be  interpreted  as  asserting 
that  the  above  sum  is  distributed  as  if  the  terms  are  independent  random  variables  (as  in  the 
Central  Limit  Theorem).  How  this  relates  to  the  spectral  nature  of  the  zeros,  or  any  possible 
underlying  operators  remains  to  be  seen. 
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Mathematics  of  Radar 


Bill  Moran 


1.  Radar  Fundamentals 

1.1.  Introduction 

Radar  is  now  used  in  many  applications  —  meteorology,  mapping,  air  traffic  control,  ship 
and  aircraft  navigation,  altimeters  on  aircraft,  police  speeding  control,  etc.  It  is  now  being 
used  in  the  form  of  ground  penetrating  radar  for  mineral  exploration  and  delineation  and  for 
land-mine  detection.  Of  course,  its  primary  role  is  in  defence.  The  theory  of  radar  is  well 
developed  and  has  many  interesting  and  difticult  mathematical  problems. 

The  aim  of  these  notes  is  to  provide  a  description  of  radar  and  its  theory  accessible  to  a 
mathematical  audience  with  the  hope  of  stimulating  interest  in  the  problems.  Often  mathe¬ 
matical  treatments  of  radar  ignore  the  problems  of  implementation.  I  intend  to  go  as  far  into 
the  engneering  of  a  radar  system  as  is  necesssary  to  provide  an  understanding  of  these  issues. 
In  the  first  section  we  shall  describe  the  operation  of  a  radar  system  at  a  relatively  detailed 
level.  Later  sections  will  cover  some  of  the  mathematics  arising  in  radar  design  and  use. 

1.2.  A  Simple  Radar  System 

The  key  references  for  this  section  are  [3, 14, 16].  Typically  radar  systems  comprise  a 
transmitter  and  receiver,  though  recently  passive  radar  systems  relying  on  the  ambient  HF  and 
VHF  radiation  have  been  built  and  made  operational.  We  shall  not  consider  such  systems.  We 
shall  also  assume  that  the  radar  transmitter  and  receiver  are  collocated  {monostatic),  again  the 
usual  situation.  Nonetheless  there  is  much  current  interest  in  bistatic  and  multistatic  radars 
The  most  important  issues  in  designing  a  radar  system  are  as  follows. 

1.  What  is  its  purpose?  How  it  will  be  used  significantly  changes  the  method  of  process¬ 
ing  the  data.  Some  radars  are  used  to  detect  targets,  others  to  track  them,  others  to 
produce  images.  Radars  used  to  check  the  speeds  of  vehicles,  for  example,  need  to  be 
able  to  measure  the  doppler  accurately  in  a  small  range  of  velocities. 

2.  The  next  most  serious  issue  is  that  of  noise.  Returns  from  relatively  distant  targets 
have  very  small  amounts  of  energy  which  can  be  swamped  by  the  noise  generated 
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within  the  receiver.  Methods  of  maximizing  the  post  processing  signal-to-noise  ratio 
are  crucially  important  in  radar  systems. 

3.  As  well  as  noise,  a  radar  system  will  receive  returns  from  objects  which  are  not  impor¬ 
tant  for  the  operational  purpose  of  the  radar.  For  example,  shipbom  radars,  typically 
receive  much  of  the  return  energy  from  the  sea  surface,  particularly  if  it  is  rough.  This 
can  mask  a  more  important  return.  Such  artefacts  are  collectively  called  “clutter”. 
Much  work  has  been  done  on  extraction  of  signals  of  importance  from  the  clutter. 

4.  The  power  available  to  the  transmitter  is  a  key  factor,  particularly  in  reducing  noise 
problems.  Evidently  this  dictates  much  about  the  size  of  the  transmitter  and  the  type 
of  electronics  to  be  used, 

5.  The  resolution  needed  in  both  doppler  and  range  and  the  range  of  values  of  these 
parameters  for  the  targets  of  importance  are  significant  factors  in  the  design  of  a  radar. 

6.  Of  increasing  importance  in  a  defence  environment  are  issues  around  jamming  and 
other  electronic  counter-measures  to  obscure  or  mask  a  target,  or  otherwise  render  a 
radar  system  ineffective.  Radar  system  developers  have  to  be  ready  to  counter  such 
interventions. 

simple  radar  system  is  sketched  in  Figure  1. 
t  a  general  level,  the  various  parts  of  a  radar  system  are: 

1.  The  antenna  —  usually  serving  the  dual  role  of  transmitting  and  receiving  signals. 
The  shape  of  the  antenna  dictates  the  shape  of  the  beam  that  is  transmitted.  Focussing 
allows  radar  systems  to  put  power  where  it  is  most  needed.  Inevitably  all  beams  have 
some  sidelobes.  Shaping  of  these  is  an  interesting  mathematical  problem  which  we  do 
not  have  time  to  cover  here.  Many  modem  radars  use  phased  array  antennas  which 
comprise  many  small  antennas  each  of  which  can  be  assigned  a  separate  (complex) 
weighting.  Such  a  system  allows  the  radar  beam  to  be  steered  electronically  rather 
than  mechanically  as  is  the  case  for  conventional  parabolic  dish  antennas. 

2.  The  duplexer  —  this  switches  the  antenna  between  transmission  and  reception.  Both 
cannot  happen  at  the  same  time  usually.  Typicallya  radar  will  transmit  for  no  more 
than  10%  of  time.  During  most  of  the  remainder  of  the  time  it  will  be  in  receive  mode. 

3.  The  radio  frequency  (RF)  oscillator  —  provides  the  carrier  signal  for  the  radar.  This 
will  usually  be  at  least  10  times  the  bandwidth  of  the  (intermediate  frequency  coded) 
waveform  it  carries. 

4.  The  RF  mixers  —  one  in  each  of  the  transmitter  and  receiver.  These  serve  to  put  the 
waveform  onto  the  carrier  signal  in  the  transmitter  and  to  remove  it  in  the  receiver  by 
mixing  it  with  the  carrier  and  then  passing  it  to  the  low  pass  filters. 

5.  Filters  and  amplifiers  —  these  amplify  the  various  signals  and  filter  out  unwanted 
frequencies.  Filtering  is  a  significant  operation  for  the  removal  of  noise.  Noise  is 
typically  “broadband”  in  that  its  spectrum  is  wide.  By  filtering  out  many  regions  of 
the  spectrum  such  noise  is  reduced.  Amplifiers  increase  the  power  in  the  signals  (and 
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Figure  1.  A  simple  radar  system 


the  noise  which  is  present)  to  make  them  less  susceptible  to  noise  in  the  later  parts  of 
the  receiver.  Filtering  also  serves  to  remove  high  frequency  components  after  mixing. 
6.  Components  corresponding  to  the  ones  in  3,  4,  5  for  the  intermediate  frequency  (IF) 
stages  of  the  system;  in  particular  the  demodulation  mixers  and  filters  which  produce 
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the  I  and  Q  channels.  These  are  obtained  by  mixing  with  two  IF  sinusoids  Tt  12  radians 
out  of  phase  with  each  other. 

Waveform  generator  —  produces  the  desired  waveform  for  modulation  onto  the  IF 
carrier  and  then  onto  the  RF  carrier  for  transmission.  The  waveform  is  also  used  to  “ 
matched  filter”  the  return.  We  shall  spend  some  time  discussing  waveform  design  in 
section  3. 

Signal  processing  block  —  this  serves  to  detect,  identify,  estimate  the  range  and/or 
doppler  of  targets,  to  remove  clutter  and  noise  not  removed  earlier  in  the  processing, 
etc.  As  analogue  to  digital  (A/D)  converters  become  faster  and  more  accurate,  signal 
processing  is  increasingly  accomplished  by  converting  the  signals  to  digital  form  and 
then  using  digital  circuitry  and  even  software.  In  fact  technology  already  at  the  (low) 
IF  stage  and  I/Q  channel  separation  can  be  accomplished  digitally. 

1.3.  Doppler  and  Range 

We  begin  here  to  build  the  mathematical  model  of  radar  processing,  by  and  large  follow¬ 
ing  Borden’s  treatment  [3].  Usually  a  transmitted  signal  comprises  a  slowly  varying  wave¬ 
form  superimposed  on  a  rapidly  oscillating  sinusoid.  Thus 

(1.1)  s(/)  =  w(t).  cos(27r(/c/  +  (l>{t)) 

where  w(i)  is  the  amplitude  modulation  waveform,  (f>{t)  represents  the  frequency  or  phase 
modulation  and  fc  is  the  carrier  frequency.  It  is  important  to  make  the  rather  obvious  obser¬ 
vation  at  this  stage  that  all  signals  transmitted  and  received  are  real-valued.  Of  course  we  can 
represent  the  signal  as  the  result  of  amplitude  modulating  two  sinusoids  with  opposite  phase 
(that  is,  differing  by  7r/2),  thus 

s(t)  =  w(t).  cos(27r0(f))cos(27r/ct)  —  w(f)  siii(27r0(t))  sin(27r/ct), 
which  in  complex  form  is 

(1.2)  s(t)  = 
provided  the  waveform  w(f)  is  real.  We  write 

(1.3)  So(t)  = 

for  the  complexification  of  the  slowly  varying  part. 

Much  of  the  theory  of  radar  processing  takes  place  in  the  complex  domain  as  we  shall  see 
later.  To  give  an  idea  of  the  relative  variability  of  the  two  components  of  (1.1),  the  carrier 
frequency  will  often  be  in  the  range  1-lOGHz  (=  1  —  10  x  10®  cycles  per  second).  The 
variability  of  the  waveform  of  course  depends  on  its  form.  Often  phase  coded  pulses  are 
used  which,  in  theory,  switch  instantaneously,  but  limitations  in  the  practical  implementation 
usually  restrict  the  highest  frequency  components  of  the  waveform  to  be  no  more  than  1/10 
of  the  carrier  frequency. 
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In  the  complex  domain  we  can  write  the  signal  as 

(1.4)  s(t)  =  =  Si(so(t).e*’^’^'=*) 

The  transmitted  signal  hits  a  target  whose  distance  from  the  (collocated)  transmitter 
and  receiver  is  R.  Let  us  assume  for  the  moment  that  the  target  is  stationary  relative  to  the 
radar  system  and  that  it  comprises  a  single  scattered  Then  the  return  signal  will,  to  a  good 
approximation  (see  Section  [deleted  -Ed.]  for  a  more  accurate  view  of  radar  scattering),  be  a 
delayed  version  of  the  original  signal  which  depends  on  the  range  R  of  the  target.  Specifically 
the  signal  voltage  at  the  antenna  of  the  receiver  is 

2J? 

(1.5)  s„(f)  =  As{t - ) 

c 

where  c  is  the  speed  of  light  and  A  represents  the  attentuation  of  the  signal  by  the  reflection. 
As  I  have  already  said,  this  is  an  approximation  to  the  true  situation,  but  a  good  one  for 
most  purposes.  In  fact,  even  if  the  target  is  totally  unchanging  the  transmitted  waveform 
really  excites  the  electrons  within  the  material  of  the  target  which  then  reradiate  a  modulated 
version  of  the  signal.  This  effect  is  slight,  but  its  use  is  being  developed  for  the  purposes  of 
target  identification.  More  importantly  the  phase  of  the  waveform  will  change  on  reflection 
from  the  target.  This  can  be  acconunodated  within  the  complex  form  of  the  signal  (that  is,  the 
complex  signal  whose  real  part  is  given  in  (1.2))  by  allowing  A  to  be  complex.  Otherwise  an 
extra  phase  shift  is  imposed  on  the  carrier. 

When  the  waveform  returns  to  the  receiver  some  noise  is  added  (“receiver  noise”)  which 
arises  from  the  thermal  activity  generated  within  the  components  of  the  receiver.  It  is  impor¬ 
tant  to  realise  here  that  the  fall-off  of  the  signal  strength  (ie  power)  as  a  function  of  distance 
of  the  target  is  as  the  fourth  power  of  the  range  R,  so  that  typically  the  received  signal  power 
is  very  small  (for  a  distant  target  10“^®  watts  is  not  unusual  —  there  is  a  saying  among  radar 
engineers  that  the  total  signal  energy  received  by  all  radars  ever  is  not  enough  to  turn  a  page) 
and  in  this  context  receiver  noise  can  be  significant.  To  a  good  approximation,  receiver  noise 
is  Gaussian.  That  is  we  can  write 

Sr(f)  =  Suit)  +  Nit), 

where  iV(f)  is  a  Gaussian  process,  for  the  signal  after  the  initial  stages  of  the  receiver. 

I  have  mentioned  the  issue  of  noise  several  times  and  its  importance  cannot  be  over¬ 
emphasised.  At  any  moment  it  will  often  be  the  most  significant  part  of  the  signal  in  the 
system  prior  to  the  matched  filter.  Extraction  of  signal  from  noise  is  a  major  issue.  In  radar 
literature  the  noise  process  is  assumed  typically  to  be  “white  Gaussian  noise”.  From  a  mathe¬ 
matical  perspective  there  are  problems  with  this  assumption.  Such  a  continuous  time  process 
cannot  exist  since  it  would  have  infinite  power.  To  be  more  mathematically  precise,  we  as¬ 
sume  that  the  power  spectral  density  of  the  noise  is  flat  across  the  interval  where 

B  represents  the  bandwidth  of  the  receiver;  that  is,  the  maximum  frequency  it  will  handle, 
as  would  be  the  case  if  the  signal  had  been  passed  through  a  (perfect)  lowpass  filter  prior  to 
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further  processing.  The  treatment  of  noise  in  this  situation  involves  interesting  ideas  in  the 
field  of  mathematical  statistics  —  specifically  estimation  theory  and  hypothesis  testing,  but 
we  do  not  have  time  to  address  them  here.  Accordingly  we  refer  the  reader  to  [13, 18]. 

Now  we  insert  the  possibility  that  the  target  is  moving.  This  has  the  effect  of  modifying 
the  transmitted  waveform  other  than  just  by  delay.  It  imposes  the  doppler  effect  on  it.  If  this 
is  done  correctly  (ie  relativistically)  it  results  in  a  “  time  dilation”  of  the  return  signal,  so  that, 
if  the  target  has  a  radial  velocity  v,  the  return  signal  Su(t)  becomes 

2R 

Su(t)  =  As{ott - ), 

c 


where 


a  = 


(i-g) 
(!  +  ?)■ 


When  V  is  much  smaller  than  c  this  is  approximated  by  a  =  (1  —  2vfc),  A  further 
approximation  is  possible  if,  as  is  usually  the  case,  the  signal  is  “  narrow  band”;  that  is,  if  its 
(Fourier)  spectrum  is  essentially  in  a  range  (/c  —  /c  +  <5)  where  <5  is  small  compared  to 
/c.  For  most  radar  applications  this  is  a  reasonable  assumption  since  the  signal  modulating 
the  carrier  will  have  relatively  low  bandwidth.  In  this  case,  the  return  signal  is  approximated 
by  shifting  the  frequency  of  the  return  from  a  stationary  target  at  the  same  range  by  fa  = 
{2v/c)fc>  This  is  best  written  in  terms  of  the  complex  signal 


s„(t)  =  3«(so(i  - 


This  equation  is  the  standard  one  used  in  most  radar  calculations. 


1.4.  I  and  Q  channels 

As  we  have  already  said,  processing  of  radar  signals  is  done  largely  in  the  complex  do¬ 
main.  Of  course  the  signal  transmitted  from  and  returned  to  the  radar  antenna  is  real.  How¬ 
ever  in  a  very  natural  way  as  we  have  seen  (1.2)  the  transmitted  signal  is  treated  as  the  real 
part  of  a  complex  signal  which  is  simpler  to  understand.  When  the  return  signal  is  received 
it  is  turned  into  a  complex  signal  —  that  is,  it  is  made  into  two  real  signals.  This  is  usually 
done  in  two  steps.  First  the  signal  is  “  demodulated”  to  one  whose  carrier  frequency  (the 
“intermediate  frequency”  or  IF)  is  much  lower  and  so  in  practical  terms  easier  to  work  with. 
Demodulation  is  done  by  mixing  (that  is,  multiplying)  the  return  signal  with  a  signal  (pure 
tone)  whose  frequency  differs  from  the  carrier  frequency  by  the  IF.  In  other  words  we  form 

Sm{t)  =  Sr{t)cOS2n{fc  +  flF)t 

where  fjp  is  the  IF.  The  resulting  signal  is  then  passed  through  a  low-pass  filter  which  re¬ 
moves  all  frequencies  greater  than  the  carrier  frequency  and  leaves  intact  signals  whose  spec¬ 
trum  lies  well  below  the  carrier  frequency.  Typically  for  an  S-band  radar  (ie  /c  ^  3  X  10® 
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GHz)  the  IF  will  be  around  lOOMHz  or  less  and  may  be  accomplished  by  two  stages  of  de¬ 
modulation.  Using  the  product  formula  for  sine  and  cosine  we  see  that  the  resulting  signal  is 
approximately 

SiF{t)  =  w(t).  cos  27r</)(r)  cos  27r((//|r  — /d)r)) 

— w(r)  sin  27r</)(r)  sin  27r(//F  —  /ci)T), 

where  r  =  t  —  It  needs  to  be  realised  that  when  I  say  “approximately”  the  degree  of 
approximation  in  this  game  is  very  large.  Because  of  the  need  to  detect  very  small  signals  the 
filters  used  will  often  attenuate  the  power  in  unwanted  frequencies  by  10“®. 

At  this  point  the  conventional  radar  system  will  mix  the  signal  with  both  cos  27v  fjpt  and 
sin  27r/jf  t  and  again  low  pass  filter  to  form  the  I  and  Q  channels  respectively.  The  radar 
engineer  really  would  like  to  obtain  an  analytic  signal  so  the  correct  thing  to  do  is  to  form  the 
Hilbert  transform  of  the  real  signal  Sr{t).  This  is  impractical,  and  so  it  is  approximated  by 
this  method.  The  approximation  is  very  good  for  narrow  band  signals.  The  complex  signal 
which  results  from  these  transformations  is 


(1.6) 


Sc(t)  =  So(t - )e 

c 


—27rifdt 


1.5.  Ambiguity 


Let  us  return  to  the  case  when  there  is  no  doppler  component;  that  is,  when  the  target  is 
stationary.  The  return  signal  is  as  in  (1.5)  together  with  some  noise.  What  happens  next  in 
the  radar  receiver  depends  on  the  information  we  want  to  extract  from  the  signal.  However  at 
this  point  there  are  just  two  types  of  information  we  might  want  to  obtain: 

1.  detection  of  the  target  —  that  is  to  decide  on  the  basis  of  a  statistical  hypothesis  test 
whether  we  indeed  have  a  signal  or  just  noise; 

2.  estimation  of  the  range  R  of  the  target. 

It  turns  out  that  the  optimal  detector  (in  the  sense  of  the  Neymann-Pearson  Lemma)  and  the 
maximum  likelihood  range  estimator  both  require  the  same  operation  —  to  take  the  correla¬ 
tion  of  the  received  signal  with  a  copy  of  the  transmitted  signal.  This  also  has  the  effect  of 
maximizing  the  post-processing  signal-to-noise  ratio.  In  fact  it  makes  sense  to  do  this  in  the 
complex  domain.  Thus  we  form  the  function 

Q{x)  =  J  Sc(t)so{t  —  x)  dt 

where  the  bar  over  the  signal  indicates  (for  when  we  treat  complex  signals)  complex  conju¬ 
gation.  The  maximum  absolute  value  of  this  signal  tells  us  the  best  estimate  of  the  range  in 
Gaussian  noise  for  this  choice  of  transmitted  signal.  Note  that  we  will  usually  have  some 
control  over  the  shape  of  the  latter  and  we  shall  spend  some  time  later  on  issues  concerned 
with  its  design. 
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When  there  is  doppler  present  the  radar  behaves  in  a  similar  way.  The  return  signal  is 
correlated  (“  matched  filtered”)  with  the  transmitted  signal,  so  that  the  resulting  signal  looks 
like  this: 


(1.7)  /  Sc{t)so{t  -  dt 

where  now  the  return  signal  Sc(t)  is  as  in  (1.6).  The  effect  of  the  matched  filter  is  to  maximise 
the  signal  power  relative  to  the  noise  power  in  the  post-processing  signal. 

We  call  this  the  radar  ambiguity  function  of  the  signal  Sq  and  write  it  as  Xso(®5  /)•  ^ 
is  the  output  we  would  obtain  from  our  receiver  for  a  signal  delay  x  =  2R/c  and  velocity 
V  =  f,cf2fc.  An  example  for  a  rectangular  pulse  is  given  in  Figure  2.  We  shall  deal  with 


Figure  2.  Ambiguity  of  rectangular  waveform 


the  ambiguity  function  in  Section  2.  We  shall  return  to  the  choice  of  waveforms  in  Section  3. 


1.6.  Extended  targets 


When  the  transmitted  signal  is  reflected  from  an  extended  scatterer  or  from  multiple  scat- 
terers  which  are  not  time  varying,  the  resulting  signal  is  obtained  as  a  convolution  of  the 
so-called  reflectivity  kernel  of  the  target(s)  with  the  transmitted  signal.  Thus  if  k{t)  is  this 
kernel,  where  now  t  is  a  measure  of  range  (which  is  the  same  as  time  in  this  context),  the 
reflected  signal  is 


iu(t)  =  f  s(t 
./ R 


r)fe(r)  dr. 


Normally  velocity  does  not  vary  from  one  part  of  the  target  to  another  so  that  this  subjected  to 
a  constant  phase  shift,  though  it  is  straightforward  to  derive  the  formula  for  varying  doppler. 
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2.  Ambiguity  Functions 

This  section  discusses  the  mathematical  structures  lying  behind  the  ambiguity  function 
(particularly  the  narrow  band  case)  as  we  have  described  it  in  Section  1.5.  We  shall  do  so  in 
terms  of  the  unitary  representation  theory  of  the  Heisenberg  group  and  we  shall  deal  with  the 
theoretical  ideas  needed  first. 

2.1.  Representations  of  the  Heisenberg  Group 

The  narrow  band  ambiguity  function  can  be  described  in  several  different  ways  at  varying 
levels  of  abstraction.  In  an  attempt  to  use  some  of  the  power  of  the  abstract  setting  without 
demanding  too  much  of  the  reader,  I  have  attempted  to  steer  a  middle  course  between  papers 
of  Schempp  [19-22]  and  the  “naive”  viewpoint  of  Wilcox  [4].  In  many  respects  the  point  of 
view  adoped  here  is  that  of  Auslander  and  Tolimieri  [12];  this  couches  the  theory  in  terms 
of  the  infinite  dimensional  irreducible  representations  of  the  Heisenberg  group.  As  these  are 
probably  the  most  sophisticated  ideas  discussed  here,  we  present  them  in  detail. 


2.1.1.  The  Heisenberg  Group.  The  three  dimensional  Heisenberg  group  is  the  group  of  3  X  3 
upper  triangular  matrices  with  I’s  on  the  diagonal,  thus: 

1  X  z 

(2.1)  u  =  0  1  /  . 

0  0  1 

We  denote  the  collection  of  all  such  matrices  with  real  entries  by  'H  and  note  that  they  form 
a  group  under  matrix  multiplication.  Observe  that  the  inverse  of  u  above  is 

1  —X  —z  +  xf 
=  0  1  -/  . 

0  0  1 

This  group  has  the  topological  structure  of  R®,  so  that  we  can  talk  about  continuity  of  func¬ 
tions  on  Lebesgue  measure  m  on  is  invariant  in  the  sense  that 

(2.2)  /(ux)  dm(x)  =  /(x)  dm(x), 

where  the  variable  x  represents  an  element  of  'H  and  u  is  an  arbitrary  element  of  'H.  Here  /  is 
any  suitable  function  for  which  the  integrals  make  sense  (such  as  continuous  complex- valued 
functions  with  compact  support).  In  fact,  for  this  group,  the  measure  is  both  left  invariant 
and  right  invariant,  so  that  equation  (2.2)  also  holds  when  u  multiplies  x  on  the  right  in  the 
argument  of  /. 
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2.1.2.  Unitary  Representations.  A  unitary  representation  is  a  map  x  17x  from  H 
into  the  group  of  unitary  operators  U{S^)  of  some  (separable)  Hilbert  space  Sy,  which  is  a 
homomorphism  in  the  sense  that 

where  the .  on  the  left  side  indicates  multiplication  of  operators  and  on  the  right  multiplication 
of  group  elements  in  71.  We  shall  also  assume  that  this  map  is  continuous  into  the  weak- 
operator  topology  on  W(^);  that  is, 

X  ($,  l/xT?) 

is  continuous  for  every  $  and  r;  in  the  Hilbert  space. 

There  are  some  simple  such  representations  of  71  for  which  is  one  dimensional  (and 
so  its  unitary  group  can  be  identified  with  the  group  of  complex  numbers  of  absolute  value  1 
under  multiplication).  An  example  is  the  trivial  representation  1  where 

lx  =  l 

for  all  X  €  H.  A  slightly  less  trivial  example  is  of  the  form 

(1  X  z\ 

0  1  /  M- 

0  0  ij 

where  a  and  0  are  any  real  numbers.  This  gives  a  two-dimensional  family,  parametrized  by 
a  and  0,  of  one-dimensional  representations,  but  these  are  not  important  for  our  purposes. 


2.1.3.  Irreducibility  and  Equivalence.  We  need  to  make  two  more  general  definitions  from 
representation  theory.  First  we  need  to  say  when  two  representations,  say  x  on  Sji, 

and  X  on  .^2 ,  are  equivalent.  By  this  we  mean  that  they  are  the  same  up  to  a  change 

of  Hilbert  basis,  or  equivalently,  that  there  is  a  unitary  operator  V  from  .^1  onto  such  that 


for  all  X  G  'W. 

A  representation  of  (x  Ux  on  say)  is  irreducible  if  it  cannot  be  broken  up  into 
sub-representations,  that  is,  if  there  are  not  two  subspaces  and  ^2  each  invariant  under 
all  of  the  operators  t/x  (x  G  iT),  such  that  0  iD2  in  the  sense  of  Hilbert  spaces. 

The  restriction  of  U  to  these  subspaces  produces  representations  and  on  and 
^2  respectively,  such  that,  for  all  x  G  H, 

where  ij  G  ^3  {j  =  1?  2).  While  it  may  not  be  obvious,  this  is  equivalent  to  the  non¬ 
existence  of  a  non-trivial  subspace  of  which  is  left  invariant  by  each  of  the  operators 
t/x?  (x  G  H).  This  is  a  consequence  of  the  unitary  property  of  the  operators.  Equally  it 
is  equivalent  by  Schur’s  Lemma  to  the  non-existence  of  a  non-trivial  bounded  operator  on  9) 
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which  comminutes  with  all  of  the  C/x-  Evidently  all  of  the  one-dimensional  representations 
described  in  (2.3)  are  irreducible. 

2. 1 .4.  The  Schrodinger  Representation,  Now  we  define  a  representation  of  on  the  Hilbert 
space  L^{R)  of  complex  square  integrable  functions  on  R  (or  equivalently  complex-valued 
finite  energy  signals  on  R)  by  letting  the  matrix  (2.3)  act  on  a  signal  5  as  a  point  target  with 
doppler  shift  f  and  delay  x  would.  Specifically  we  write 

(2.4)  U2{8){t)  =  -  x). 

Note  that  the  variable  z  acts  as  a  scalar  multiplier  (namely  the  scalar  which  makes 

the  homomorphism  property  work  out  correctly.  Here  7  is  a  non-zero  real  number.  Now  the 
Stone-von  Neumann  theorem  says  that,  up  to  equivalence  of  representations,  the  representa¬ 
tions  of  the  form  are  the  only  irreducible  representations  which  are  not  one  dimensional, 
so  that  (2.3)  and  (2.4)  are  descriptions  of  all  of  the  irreducible  unitary  representations  of  the 
Heisenberg  group  up  to  equivalence.  The  particular  forms  of  the  infinite  dimensional  irre¬ 
ducible  representations  of  the  Heisenberg  group  given  in  (2.4)  are  called  the  Schrodinger 
representation. 

In  fact  the  representation  in  (2.4)  is  characterized  via  the  Stone-von  Neumann  theorem 
as  the  unique  (up  to  equivalence)  irreducible  representation  whose  restriction  to  the  centre  of 
V.  (that  is,  to  the  matrices  as  in  (2.1)  for  which  /  =  x  =  0)  is  just  where  I 

is  the  identity  operator  on  L^{R). 

2,2.  Ambiguity  Functions 

It  is  now  easy  to  define  the  ambiguity  function  corresponding  to  a  signal  s  in  terms  of  the 
infinite  dimensional  irreducibles  of  the  Heisenberg  group.  Writing  U  =  we  define 

(2.5)  Xs(«,/)  =  (s,C7xS) 
where 

/I  /  0\ 

(2.6)  X  =  I  0  1  X  . 

Vo  0  1) 

This  produces  the  function  we  have  defined  in  (1.7).  This  view  of  ambiguity  functions  can 
be  found  in  the  paper  [12]  of  Auslander  and  Tolimieri.  The  choice  of  element  of  the  group 
may  seem  arbitrary,  but  remember  that  the  only  effect  of  changing  the  top  right  hand  comer 
of  this  matrix  is  to  multiply  the  ambiguity  function  by  a  scalar  of  absolute  value  1.  In  fact,  it 
is  really  the  absolute  value  of  the  ambiguity  function  which  is  important,  since  it  represents 
the  response  of  the  system  to  a  target  of  a  given  range  and  frequency,  and  this  is  unchanged 
by  a  change  in  the  top  right  hand  comer.  In  effect  we  are  making  a  choice  of  cross  section  of 
the  quotient  of  'H  by  its  centre  Z. 


306 


B.  Moran  /  Mathematics  of  Radar 


Once  we  have  made  this  definition,  it  makes  sense  to  allow  the  cross  ambiguity  function 

Xs,s'(^i  f)  —  ), 

where  x  is  as  in  (2.6). 

2.2.1.  Properties.  The  (cross-)ambiguity  function  has  a  number  of  remarkable  properties, 
one  of  the  most  important  of  which  is  Moyal’s  identity: 

Theorem  1.  Let  s  and  s'  be  in  L^(R).  Then 

=  Iklllk'll- 

This  is  a  simple  consequence  of  the  following  theorem: 

Theorem  2.  Let  si,S2,S3,S4  be  in  L^{R).  Then 

One  of  the  important  consequences  of  Moyal’s  identity  is  that  the  unitary  representation 
X  t/x  is  a  square  integrable  representation  modulo  the  centre  of  IL.  Such  representations 
have  been  widely  studied. 

Another  remarkable  property  of  this  quadratic  form  on  is  its  variation  under  the 

Fourier  transform  operator. 

Theorem  S.LetsE  L^{R)>  Thenjorany  (x,/)  G  R^, 

(2.7)  X^(5)(~/,x)  =  e2-^"X5(®,/). 

In  fact  this  arises  from  an  interesting  intertwining  property  of  the  Schrodinger  represen¬ 
tation.  The  map 

(x,f,z)  ^  {f,-x,-z) 

is  an  anti-automorphism  of  the  Heisenberg  group  which  we  call  k  and  so  x  t— >>  k{x)~^  is  an 
automorphism.  Call  this  u.  Then 

i/{x,  /,  z)  =  (-/,  x,z~-  fx). 

Moreover 

:F(t7x(s))  -  17,(x)(:f(s)), 

so  that  the  Fourier  transform  operator  intertwines  the  two  representations  U  and  D  where 

(2.8)  ~ 

Now,  for  s  G  i^(R), 

e""'"xs(®,/)  =  (s,t/xs)  =  (:F(s),:F(C7xS)), 
by  the  Plancherel  Theorem,  and  in  view  of  (2.8)  this  equals 
(J^(s),r>,:F(s)>  =  (J^(s),t/,w:F(s))  = 
which  proves  equation  (2.7).  A  corresponding  equality  holds  for  the  bilinear  form. 
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2.3.  Weil-Brezin  Formula  (Zak  Transform) 

There  is  another  way  of  realising  the  unitary  representations  we  have  described  and  used 
above.  It  will  simplify  formulae  if  we  deal  only  with  the  case  7=1,  though  of  course 
the  theory  extends  in  an  obvious  way  to  all  other  non-zero  values  of  7.  Instead  of  using  the 
Hilbert  space  L^{R)  we  consider  a  Hilbert  space  ^  of  functions  F  on  the  Heisenberg  group 
itself.  Let  T  be  the  subgroup  of  Ti  consisting  of  matrices  of  the  form 

(1  n  z 
0  1m 
0  0  1 

where  n,  m  G  Z  and  z  G  R.  Consider  the  space  ^  of  functions  F  on  FL  which  satisfy 
1.  F(7x)  =  for  all  7  G  F  as  in  (2.9); 

2-  fn/r  <  00, 

with  the  inner  product 

(2.10)  {F,G)^=  f  F(x)GWdx. 

Jn/r 

Next  we  define  a  representation  V  of  FL  just  by  the  simple  formula 
(2.11)  V^(F){y)  =  F(yx). 

We  need  to  make  a  few  remarks  about  these  formulae.  First  note  that  1.  makes  |F|  constant 
on  the  right  cosets  of  F,  so  that  the  integral  in  2.,  which  is  over  the  right  coset  space  Fo\iT, 
makes  sense.  This  applies  equally  to  the  integral  in  (2.10).  Secondly,  the  space  of  functions 
R  may  be  realised  as  the  space  of  sections  of  a  line  bundle  on  the  torus  T^.  Thirdly, 
those  familiar  with  induced  representations  will  recognize  that  this  is  just  the  representation 
T,  induced  from  the  subgroup  F  to  FL,  where  r  is  given  by 

In  z\ 

0  1  m  — 

001/ 

that  is,  V  =  indJ^T,  in  the  terminology  of  Mackey.  We  note  that  F  is  a  normal  subgroup 
of  91  and  so  we  may  apply  standard  Mackey  theory.  The  stabilizer  of  this  representation  is 
just  F  itself  and  so  the  induced  representation  is  irreducible.  Now  the  Stone-von  Neumann 
Theorem  tells  us  that  the  only  irreducible  representation  whose  restriction  to  the  centre  ofH, 
the  subgroup  of  matrices  with  x  and  y  both  zero,  is  just  times  the  identity  operator, 
is  the  one  we  have  called  W  earlier  with  7  =  1.  It  follows  that  the  representation  V  and 
U  =  are  equivalent,  so  there  is  a  unitary  operator  W  intertwining  U  and  V. 
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It  is  not  difficult  to  calculate  that  the  intertwining  operator  is  W  :  L^{R)  ^  ^  given  by 

(2.12)  W{s){f,x,  z)  =  Y. 

k^Z 

This  formula  is  called  the  Weil-Brezin  formula  or  (usually  without  the  term  the  Txik 

transform.  In  essence  it  is  a  map  between  functions  on  R  and  functions  on  the  unit  square 
(the  cross  sections  of  the  line  bundle)  and  maps  L^{R)  onto  L^{P)  where  I  represents  the 
unit  interval.  In  fact  the  map  is  unitary.  Since  it  intertwines  two  irreducible  representations  it 
has  to  be  unitary  up  to  a  scalar  multiple. 

This  gives  us  another  way  of  representing  ambiguity  functions,  since  for  s  6  Z/^(R), 

(2.13)  =  (5,C/x>  =  {W{8),V^W{s)). 

This  alternative  view  is  a  useful  device  in  realising  theoretical  features  of  x- 

2.4.  Resolution  and  Hermite  functions 

This  section  is  taken  largely  from  the  work  of  Wilcox  [4].  He  uses  a  slightly  modified 
form  of  the  ambiguity  function.  Indeed  in  many  papers  the  ambiguity  of  signal  s  is  defined 
in  a  symmetrised  form: 

(2.14)  As(a;,  f)  =  j^ 

This  formalism  corresponds  to  a  slightly  different  description  of  the  Heisenberg  group  as  the 
group  on  R^  whose  multiplication  is  given  by 

(2.15)  (xi,  /i,  zi)  *  (x2,  /2»  Z2)  =  (®i  4-  fi  +  /2»  ^1  +  2:2  +  B(xi,  /i;  X2,  /2)) 
where 

(2.16)  B(xi,  A;  X2,  A)  =  ^{xif2  -  /i®2)- 

It  is  important  to  be  aware  that  this  does  not  change  the  significant  properties  of  the  ambiguity 
function.  In  fact 

As(x,/)  =  e--^"Xs(®,/). 

The  Hermite  functions  play  a  central  role  in  the  theory  of  this  object.  For  our  purposes 
we  define  them  by  follows: 

91/4 

Wn{t)  =  (n  =  0, 1, 2,3,.. .) 

Vn! 

where  Hn  is  the  nth  Hermite  polynomial: 

Hn{t)  =  2’  3,  •  •  •)• 
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The  Hermite  functions  form  an  orthogonal  basis  of  X^(R).  Their  ambiguities  are  easily 
calculated  in  terms  of  known  functions: 

Aw„ix,y)  =  e“5(®’+»“)l,„(i(,r(a:^  +  y^)) 
where  Ln  is  the  nth  Laguerre  function 

Ln{x)  =  (x”e“®)  forx  >  0. 

n!  vd®/ 

Evidently  such  functions  are  radially  symmetric.  In  fact  this  characterizes  Hermite  wave¬ 
forms,  as  Wilcox  observes  in  [4]. 

Theorem  4.  An  ambiguity  function  Ag^x,  y)  is  radially  symmetric,  that  is, 

A,{x,y)  =  f{x^  +  y^) 
if  and  only  if  s  =  cWnfor  some  n. 


We  mention  here  too  the  many  papers  of  Schempp  on  this  subject  and  on  the  ambiguity 
function  in  general  (see,  for  example,  [19-22]). 

The  sharpness  of  the  peak  of  the  ambiguity  function  (at  (0, 0),  of  necessity)  is  a  measure 
of  the  waveform’s  capacity  to  resolve  nearby  (in  both  doppler  and  range)  targets.  One  way 
of  measuring  resolution  capabilities  is  as  follows.  Fix  a  minimum  response  resolution  e  and 
consider  the  level  curve 

=  1 -47r^e^- 

This  is  approximately  an  ellipse,  which  Wilcox  calls  the  resolution  ellipse.  Write  its  equation 
as 

-f  2'yxy  + 

In  fact 

»/r  «/r 

are  respectively  measures  of  the  bandwidth  and  the  time  duration  of  the  signal.  Two  close 
point  targets  will  produce  a  response  which  comprises  two  copies  of  the  ambiguity  function 
with  slightly  different  centres.  These  two  targets  will  be  most  difficult  to  resolve  if  their 
resolution  ellipses  share  a  common  major  axis.  The  two  targets  are  said  to  be  resolvable  if 
the  centres  of  their  resolution  ellipses  are  one  major  semi-axis  apart.  This  number  —  the 
smallest  resolvable  separation  —  is  given  by 


Wilcox  calls  R{€) /e  the  resolution  factor  p,  but  perhaps  for  mathematical  precision  the 
resolution  factor  should  be  the  derivative  of  R{e)  at  0.  Wilcox  has  shown  that  the  best 
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resolution  factor  obtainable  subject  to  constraints  ^  jSo  and  r  ^  tq  on  the  bandwidth  and 
time  duration  is 

1 

p  = - . 

mm{0o,  To) 

The  Hermite  functions  arise  in  this  context  as  follows.  Let  be  the  subspace  of 
consisting  of  all  functions  of  the  form  where  P  is  a  polynomial  of  degree  at  most 

n.  The  following  theorem  of  Hardy  characterizes  r2„. 

Theorem  S.  Lets  e  satisfy 

8{t)  =  |t|  ->  oo, 

Ts{f)  =  Oire-^f"),  I/I  ^  oo. 

Then  s  6 

Now  we  have  the  following  theorem  of  Wilcox. 

Theorem  6.  The  Hermite  waveform  Wn  has  the  smallest  resolution  factor  of  all  wave¬ 
forms  in  Qn- 

2.5.  Characterization 

We  discuss  here  the  problem  of  how  to  describe  the  class  *4,  of  ambiguity  functions  Xb  as 
3  ranges  over  all  functions  in  L^{R).  One  easy  characterization  arises  from  the  observation 
that  the  Fourier  transform  of  the  ambiguity  function  with  respect  to  the  doppler  variable  is 

■^2''‘(Xs)(a:,r)  =  s(r)s(r  -  a:), 

where  we  use  the  notation  ^2  to  mean  the  Fourier  transform  with  respect  to  the  second 
variable.  Making  the  change  of  variable 

T  —  X  =  U,  T  =  T, 

we  obtain  the  function 

H{u,t)  =  s(r)s(w). 

We  write  H  =  72.(xs)  to  denote  this  transformation;  that  is, 

(2.17)  n(F){3,t)  =  :F--^(F)(t  -  s,  t). 

To  describe  ambiguity  functions  then  it  is  enough  to  describe  these  functions  intrinsically 
and  this  is  easily  accomplished.  The  following  is  a  theorem  of  Wilcox  [4],  but  see  also  [12]. 

Theorem  1.  Let  H  e  L^(R^)  satisfy 

1.  H{u^t)  =  H{t,u) 

2.  H{x,x)  ^  0 

3.  H{v,v)H(x,t)  =  H{x,v)H{v,t) 
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Then  there  is  a  signal  s  €  such  that  H^ixs)  —  H;  that  is, 

Xs{x,f)=T2{K){x,f) 
where  K(u^  r)  =  if(r  —  u,  r). 

A  somewhat  similar  theorem,  also  due  to  Wilcox,  characterizes  ambiguity  functions  in 
terms  of  their  expansions  in  tensor  product  bases  of  the  Hilbert  space 

Z2(R2)  =  L^(R)  (g)  l2(r). 

A  tensor  orthonormal  basis  of  X^(R^)  is  a  basis  V^rr»,n(®j  v)  =  </>m(®)0n(2/)  where  (<^n) 
is  an  orthonormal  basis  of  X^(R).  Any  member  F  of  L^(R^)  has  an  expansion 

(2.18)  F(.X,y)  =  'Y^Cm,n4>m{x.)(t>n{.y) 

m,n 

which  converges  in  L^(R^).  The  following  simple  theorem  characterizes  ambiguity  func¬ 
tions  in  terms  of  their  expansion  in  such  a  basis. 

Theorem  8.  A  function  F  G  L^(R^)  is  an  ambiguity  function  if,  in  equation  (2,18), 

Cm,n  ~  ^nO>rn 

for  some  sequence  (a„). 

This  has  the  following  immediate  corollary 

Corollary  1.  A  function  F  G  L^{R^)  is  an  ambiguity  function  if,  in  equation  (2.18), 

Ck,kCm,n  =  Cm,kCk,n  and  Ck,k  >  Ofor  all  k,7n,n. 

Another  characterization  of  ambiguity  functions  arises  from  their  description  in  terms 
of  the  representation  of  the  Heisenberg  group.  To  this  end,  we  recall  that  a  complex-valued 
function  p  on  His  positive  definite  if,  for  any  choice  of  N  complex  numbers  Cn  and  elements 
of  the  group  x„, 

N 

52  >  0. 

m,n—l 

positive  definite  functions  arise  as  functions  of  the  form 

p(x)  =  (^,H(x)$), 

where  il  is  a  representation  of  the  group  and  $  and  is  an  element  of  the  Hilbert  space  on 
which  it  acts.  Among  the  positive  definite  functions,  the  ones  which  arise  from  irreducible 
representations  are  characterized  by  the  following  property:  we  call  a  positive  definite  func¬ 
tion  p  extremal  if,  for  all  other  positive  definite  functions  q  for  which  p  —  aq  is  also  positive 
definite  for  some  a  >  0  g  is  a  multiple  of  p. 
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Since  we  know  the  irreducible  representations  of  the  Heisenberg  group,  we  also  know 
all  extremal  positive  definite  functions.  An  ambiguity  function  is  now  a  function  /)  in 
(R)  for  which  the  function 

p{x,f,z)  = 

is  an  extremal  positive  definite  function.  While  this  characterization  is  not  as  easily  checked 
as  the  earlier  ones  it  is  of  theoretical  importance. 

2.6.  The  Abiguity  Problem 

The  simple  abiguity  problem  is  that  of  determining  a  signal  s  from  its  ambiguity  xs-  To 
make  it  mathematically  meaningful  we  restrict  the  signal  to  belong  to  so  that  for  such 

signals  we  are  interested  in  the  question:  is  s  one  to  one.  The  answer  to  this  question 

is  yes. 

It  is  relatively  simple  to  show  the  following  theorem  (see,  for  example,  [12]), 

Theorem  9.  Let  Si  and  S2  be  two  L^{R)  functions  for  which  Xsi  ~  Xs^  =  •^2- 

However  this  is  not  really  an  answer  to  the  radar  engineer’s  problem  for  several  reasons, 
one  of  which  is  that  the  ambiguity  function  is  not  what  is  important.  It  is  the  absolute  value 
of  the  ambiguity  function  which  is  really  of  interest.  In  this  context,  the  problem  arises  of 
determining  all  possible  waveforms  s  whose  ambiguity  functions  have  the  same  absolute 
value.  Much  of  this  section  is  adapted  from  work  of  Philippe  Jaming  indexJaming,  Philippe 
and  his  collaborators  (see  [7, 10, 15]).  Following  Bueckner  ( [9])  we  state  here  the  following: 

Definition  1  (General  Abiguity  Problem).  Given  si,S2  G  what  is  the  set  of  all 

pairs  5 j ,  s^ 

(2-19)  /)1 

for  all  {x,f)  €  R^? 

This  problem  is  unsolved  in  general,  but  the  following  simple  observation  shows  that 
equation  (2.19)  does  not  force  (si,  52)  =  s'f).  Let 

(2.20)  s^.  =  where  x  E  TL. 

Then  ,  with  x  =  («,  /,  0)  E  'H, 

=  |(Sl,l/*,t/xl7x-S2>|  =  |(si,t7,C;,S2)| 

where  z  =  x“^x'”^xx'.  Now  z  is  a  commutator  and  so  (since  the  Heisenberg  group  is  two 
step  nilpotent)  belongs  to  the  centre  of  71.  It  follows  from  the  irreducibility  of  U  that  is 
just  a  scalar  times  the  identity  operator,  say  and  hence  that 

Ksi,t/.t/zS2)|  =  |(Sx,{7.t7.S2>|  =  |(e="'"Si,{7,S2)|  =  |(s„i;xS2)|  =  |xs.,s,(x,/)|. 
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We  will  say  that  s^,  are  Heisenbery  related  to  Si  and  S2  if  a  relationship  holds  as  in 
(2.20).  Two  such  related  pairs  have  the  same  absolute  values  for  their  ambiguity  functions. 
The  problem  is  now  restated  as:  Does  (2.19)  force  Sj,  Sj  to  be  Heisenberg  related  to  si,  S2?. 
Unfortunately  this  is  not  the  case.  For  instance  if 


Si(t)  =  S2(t)  = 

hit)  =  s;(t)  = 


sin(27rnf) 

cos(27rnf) 


then 

IXSi,Sj(a:,/)|  = 

for  all  (x,  f)  6  R^,  though  si  is  not  Heisenbery  related  to  Sj.  The  proof  of  this  is  relatively 
easy  if  one  works  in  the  Fourier  domain  and  uses  the  identity  (2.7).  The  Fourier  transforms 
of  these  functions  comprise  two  non-zero  identical  “pieces”  around  =b2n7r;  the  difference 
between  the  two  being  in  the  signs  of  these  pieces.  The  ambiguity  therefore  is  non-zero  only 
when  translated  of  these  pieces  overlap.  It  is  now  clear  that  the  ambiguities  can  differ  only  is 
sign. 

For  special  classes  of  functions  it  is  the  case  that  all  s^,  s^  are  obtained  by  such  transfor¬ 
mations.  For  instance,  Bueckner  [9]  and  De  Buda  [5]  have  proved  that,  if  Si  and  S2  are  both 
of  the  form  P{t)  exp(— y)  with  P  a  polynomial,  then  s^  and  are  Heisenbery  related  to 
Si  and  S2. 

laming  indexJaming,  Philippe  [10]  has  obtained  some  results  on  the  abiguity  problem 
for  compactly  supported  functions.  His  results  are  stated  in  terms  of  the  symmetrised  form, 
though  obviously  simple  modifications  convert  to  the  x-form  of  the  ambiguity  function.  Let 
u  6  L^{R)  be  a  compactly  supported  function  and  suppose  v  satisfies  Then 

it  is  relatively  easy  to  see  that  v  is  also  compactly  supported.  Moreover  if  the  support  of  u 
is  contained  in  an  interval  of  length  2a  then  the  support  of  v  is  also  contained  in  an  interval 
of  length  2a.  In  fact  laming  shows  that  the  interval  of  support  must  have  the  same  length  for 
both  u  and  v. 

We  may  now  assume  that  supports  of  both  u  and  v  are  contained  in  [—a,  a]  and  no 
smaller  interval,  in  particular,  u  and  v  are  compactly  supported.  The  Paley- Wiener  theorem 
ensures  that  Auix^  y)  and  Ay{Xi  y)  are  both  entire  functions  of  exponential  type  in  the  y 
variable.  Now  (cf :  [10])  the  following  holds. 


(2.21)  A^{x,y)Ay{x,y)  =  Ay{x,y)Au{x,y)  for  all  a?  €  R,2/  €  C. 

On  the  other  hand,  by  the  Hadamard  factorisation  theorem,  an  entire  function  f{z)  of  ex¬ 
ponential  type  is  entirely  determined  by  its  zeros,  up  to  a  factor  with  A,  /r  6  C. 
Unfortunately  (2.21)  only  tells  us  that,  for  fixed  «,  if  2:  is  a  zero  of  Au{x, .)  then  either  z  or 
^  is  a  zero  of  At,(a;, .). 
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Several  cases  occur,  for  instance,  may  only  have  real  zeros  (e.g.  if  w  =  A[o,b])»  then 
Au  and  Aj,  have  the  same  zeroes. 

There  are  some  functions  u  for  which  every  ambiguity  partner  v  is  such  that  either  A^ 
and  Ay  have  the  same  zeroes,  or  A^  and  Azv  have  the  same  zeroes,  where  Zu{t)  = 
u{—t).  The  final  alternative  is  that  A{u)  and  A{v)  may  have  some  common  non-real 
zeroes  and  some  conjugate  zeroes. 

In  what  follows,  after  replacing  u  by  Zu  or  by  some  function  Heisenbery  related  to  u, 
we  shall  assume  that  A{u)  and  have  the  same  zeroes.  In  other  words  we  now  consider 
the  following  restricted  radar  abiguity  problem  : 

Problem  1  (Restricted  Radar  Abiguity  Problem).  Given  a  compactly  supported  u  G 
L^{R),  what  is  the  set  of  ambiguity  partners  v  ofu,  such  that  for  every  x  £  R,  A{u){xj .) 
and  A{v){x, .)  have  the  same  zeroes  in  the  complex  plane  ? 

laming  calls  such  ambiguity  partners  restricted  ambiguity  partners  and  shows  that  there 
exist  compactly  supported  functions  u  which  have  ambiguity  partners  that  are  not  restricted 
ambiguity  partners  either  of  u  or  of  Zu.  He  also  shows  the  following  result. 

Theorem  10.  Let  u  G  L^(R)  be  a  compactly  supported  function  and  let  v  be  a  re¬ 
stricted  ambiguity  partner  ofu.  If  VI  is  the  open  set  of  all  x  such  that  A{u){x, .)  is  not 
identically  0,  there  exists  a  locally  constant  function  <p  on  ft  such  that,  for  every  to,  ti,  t2 
belonging  to  the  support  ofu, 

4>(.t2  -  h)  +  <l>{ti  -  to)  =  -  to)  (27r) 

and 

v(x)  =  -  o) 

for  some  a£R^(v£R,  cET  and  some  Xq  belonging  to  the  support  ofu.  Conversely, 
every  function  v  of  that  form  is  a  (restricted)  ambiguity  partner  of  u. 

This  theorem  essentially  states  that  if  u  is  “simple”  (in  particular,  the  support  is  an  in¬ 
terval)  then  the  solutions  of  the  abiguity  problem  are  “simple”,  whereas  for  complicated  u 
(for  example,  when  the  support  has  big  gaps)  the  solutions  are  also  complicated.  In  [7],  the 
discrete  abiguity  problem  is  considered  and  various  interesting  results  are  obtained. 

2.7.  Approximation 

An  important  problem  for  the  radar  engineer  is  to  invent  waveforms  with  specific  ambi¬ 
guity  properties.  The  abiguity  problem  of  the  previous  section  is  one  part  of  that  problem  — 
it  examines  the  possibility  of  uniquely  specifying  a  waveform  in  terms  of  its  ambiguity  (or 
its  absolute  value),  and  have  we  have  seen  without  further  constraints  this  is  not  successful. 
Nonetheless  the  radar  engineer  needs  despite  the  non-uniqueness  to  be  able  to  find  wave¬ 
forms  with  appropriate  ambiguity  fucntions.  One  way  to  do  this  is  first  to  invent  a  function 
F  G  X^(R^)  with  the  appropriate  properties  and  then  find  the  ambiguity  function  which 
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most  closely  approximates  it.  In  other  words,  the  problem  is  to  find  the  signal  s  G  i^^(R) 
which  satisfies 

s  =  argmin||A,  - 

Using  the  unitary  operator  71  described  in  equation  (2.17),  we  obtain 

1 1  A.  -  J’lP  =  |k(t)I(?)  -  7t(F)|P  =  2(1  -  {niF),s®  s)) 

if  both  F  and  s  are  normalized.  The  problem  now  is  one  of  maximizing 


{71(F),  s(^  s)) 


which  corresponds  to  finding  the  eigenvector  whose  eigenvalue  has  the  largest  absolute  value 
of  the  integral  operator  with  kernel  7i,  that  is. 


7i(u)(t)  —  f  H(t,T)u(T)  dr, 
«/R. 


This  operator  is  compact  and  so  its  eigenvalues  /X2?  •  •  •  decrease  in  absolute  value  and 
tend  to  zero.  We  choose  for  our  signal  s  the  eigenvector  corresponding  to  /xi. 


2.8.  The  Wide-Band  Ambiguity  Function 


The  classical  ambiguity  function  of  Woodward  is  a  result  of  making  the  narrow  band 
approximation.  There  are  situations  where  this  is  not  valid.  For  example  in  sonar  work  where 
the  speed  of  sound  through  water  is  not  sufficiently  high  compared  to  velocities  of  targets  that 
the  approximation  is  valid.  Also  in  radar  the  approximation  breaks  down  in  two  ways: 

1.  the  velocity  is  a  substantial  proportion  of  the  velocity  of  light.  This  is  an  unlikely 
scenario  in  practice. 

2.  the  radar  is  “wide-band”;  that  is,  it  uses  a  broad  spectrum.  This  is  an  area  of  increasing 
interest. 


The  wide  band  cross  ambiguity  function  is  defined  by 


l^Si,S2(®,a)  =  [  si(f)s2(a(f  +  x))  dt, 

7R 


where  a  >  0  represents  the  scaling  due  to  doppler  and  x  corresponds  to  range  as  before; 
the  bar  over  the  second  term  in  the  integral  corresponds  to  complex  conjugation.  The  (auto)- 
ambiguity  function  of  a  signal  s  is  We(x,  a). 

This  formula  too  corresponds  to  a  representation  of  a  group.  This  time  the  group  in 
question  is  the  so  called  a®  +  6  group  G  of  2  x  2  matrices 


where  a  >  0  and  6  G  R.  The  representation  is  again  on  L^{B)  and  is  given  by 

Ua,b{s)(.t)  =  a*''^s(af  +  6) 
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for  /  €  Thewide-band  ambiguity  function  is  then  just 

^81,82(3^,0:)  =  (si,  t7a,x(S2))i;,2(R)- 

Several  properties  of  the  narrow  band  ambiguity  function  carry  over  to  the  wide  band  case. 
However  it  is  necessary  to  restrict  to  a  smaller  class  of  functions  to  make  the  theory  closely 
approximate  that  of  the  narrow  band  ambiguity.  We  write  if^(R)  for  the  Hardy  class  of 
functions  whose  Fourier  transforms  vanish  on  the  negative  half-line.  Note  that  this  subspace 
is  invariant  under  the  action  of  Ua,b  for  all  a  >  0  and  6  6  R.  The  original  representation 
therefore  cannot  have  been  irreducible,  though  the  restriction  to  ff  ^  (R)  is.  We  need  to  restrict 
further  to  the  those  functions  s  which  satisfy 

l^(«)(/)l^  y  <  «>• 

This  class  is  designated  and  we  write 

(si,  S2)  =  y  • 

It  is  now  possible  to  obtain  a  Moyal  type  identity.  For  u  in  i/^(R)  and  v  €  Wu,v  is  in 
^  =  £^(R*i*  X  R;  a~^  da  dh),  and 

)i:,3(R) 

There  are  more  aspects  of  the  narrow  band  theory  which  carry  over  to  the  wide  band  theory 
with  appropriate  modifications.  For  example  the  approximation  results  of  Section  2.7  have  a 
counterpart  in  the  wide  band  theory.  The  ax-\-b  group  is  an  example  (the  simplest)  of  a  non- 
unimodular  group  —  its  left  and  right  invariant  measures  are  different.  It  is  this  fact  which 
produces  the  need  for  the  two  separate  Hilbert  spaces  H^{R)  and  S^.  For  more  information 
on  this  issue,  the  reader  is  referred  to  [6].  For  more  on  the  general  theory  of  the  wide-band 
ambiguity,  we  refer  the  reader  to  [1,2, 11, 17,23].  laming  [10]  has  considered  the  ambiguity 
problem  for  wide-band  ambiguity. 

3,  Waveform  Design  and  Processing 

For  many  purposes  the  ideal  radar  waveform  would  produce  an  ambiguity  function  which 
was  the  so-called  “thumbtack”  —  that  is,  zero  everywhere  except  at  the  origin.  This  would 
have  ideal  range  and  doppler  discrimination.  However  Moyal’s  identity  tells  us  that  no  (finite 
energy)  signal  gives  rise  to  that  waveform  since  the  result  has  to  be  in  (R^)  and  have  norm 
equal  to  the  square  of  the  norm  of  the  signal.  A  very  short  pulse  would  seem  to  have  many 
advantages  in  this  sense.  It  has  a  good  thumbtack-like  ambiguity  at  least  in  the  range  direc¬ 
tion.  However  it  has  many  disadvantages.  If  it  is  not  to  have  extremely  high  power  (which  of 
course  means  that  the  electronics  have  to  be  able  to  deliver  this  amount  of  power  to  the  an¬ 
tenna)  then  the  total  electromagnetic  energy  hitting  the  target  is  very  small  and  the  resulting 
energy  returned  to  the  receiver  orders  of  magnitude  smaller.  The  received  energy  falls  off  as 
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the  fourth  power  of  the  distance  of  the  target  so  total  energy  on  target  is  a  significant  factor 
in  radar  detection,  particularly  of  distant  targets.  From  this  perspective  longer  waveforms  are 
better.  Another  disadvantage  of  a  short  pulse  is  that  it  is  a  very  wideband  signal,  and  so  not  so 
good  in  the  doppler  direction.  Moreover,  to  produce  the  shape  of  siuch  a  pulse  and  effectively 
deal  with  it  on  reception  requires  electronics  capable  of  handling  such  signals.  Maintaining 
linear  responses  over  such  a  frequency  spread  is  difficult.  Radar  engineers  then  aim  to  have 
long  waveforms  whose  auto-  correlation  ( matched  filter)  produces  something  approximating 
a  thumb-tack.  This  process  is  called  pulse  compression,  A  purely  random  signal  of  infinite 
length  (and  therefore  energy)  —  that  is  one  comprising  a  bi-infinite  sequence  of  Gaussian 
random  variables  of  zero  mean  and  constant  variance  has  an  expected  ambiguity  which  is 
a  thumbtack.  Figure  3  gives  a  the  ambiguity  of  a  finite  waveform  obtained  using  a  random 
number  generator.  As  a  result  pseudo-random  codes  (which  are  finite  and  deterministic  ap¬ 
proximations  to  random  signals)  are  often  used  as  waveforms  in  radar.  Many  papers  have 
been  written  examining  the  properties  of  different  waveforms.  We  give  a  small  sample  of 
these  here. 


10^  AmblouRy  of  rorttfafn 


o  2  «  lo 

Votootiy  tn  motroo  P4»r  oooonct  1 0* 


Figure  3.  Ambiguity  of  random  waveform 

Before  we  do,  however,  we  note  that  for  some  purposes  a  thumbtack  is  not  the  best  wave¬ 
form.  For  example  there  are  circumstances  in  which  a  waveform  which  is  extremely  doppler 
tolerant  (that  is  its  ambiguity  is  a  ridge  along  the  doppler  axis)  and  in  others  we  may  require 
range  tolerance. 

3.1.  Conventional  waveforms 

We  shall  first  talk  about  waveforms  for  a  convventional  radar. 

3.1.1.  Rectangular  and  Gaussian  pulses.  The  simplest  waveform  used  in  radar  is  just  a  sim¬ 
ple  pulse,  either  rectangular  or  Gaussian.  Such  pulses  are  shown  in  Figure  4  and  their  ambi¬ 
guity  functions  in  Figure  5.  In  fact,  of  course,  we  plot  the  absolute  value  of  the  ambiguity 
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Rectangular  pulse 


Gaussian  pulse 


Figure  4.  Rectangular  and  Gaussian  waveforms 


X  10^  Ambiguity  of  rectangular  pulaa  x  10“*  Ambiguity  of  Gaussian  pulse 


Figure  5.  Ambiguities  of  rectangular  and  Gaussian  waveforms 


since  in  general  it  is  a  complex  function.  Notice  the  “sine”  behaviour  on  the  doppler  direction 
of  the  rectangular  pulse.  It  has  a  relatively  sharp  peak  in  the  doppler  direction  but  then  has 
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“sidelobes”.  The  effect  of  these  sidelobes  is  to  mix  targets  of  differing  dopplers  into  the  same 
doppler  bin.  This  is  an  issue  for  shipbome  radars,  in  particular,  where  the  reflection  from  the 
sea  close  to  the  ship  might  obscure  a  distant  (and  therefore  fainter)  airplane  whose  doppler 
should  separate  it. 

A  glance  at  the  doppler  axes  in  Figure  5  shows  that  1  have  plotted  doppler  shifts  cor¬ 
responding  to  quite  unrealistic  velocities.  I  should  point  out  that  the  values  assume  a  100 
microsecond  pulse  (a  long  one)  at  10  GHz.  A  more  realistic  ambiguity  plot  is  shown  in  Fig¬ 
ure  6  —  it  gives  the  ambiguities  of  the  two  waveforms  for  velocities  up  to  mach  3  —  about 
1km  metres  per  second.  This  too  is  an  unrealistic  representation  since  the  grey  scales  do  not 
adequately  convey  the  significance  of  the  pixels  in  the  image.  Again  from  an  engineering 
viewpoint  it  is  better  to  plot  the  logarithm  of  the  values  of  the  ambiguity  —  to  be  precise 
we  need  a  dB  plot  where  the  value  in  decibels  is  equal  to  20  log^o  |xl-  These  are  shown  in 
Figure  7.  As  can  be  seen,  the  main  problems  with  Aese  waveforms  are 
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K  10'*  Ambiguity  of  Gaussian  pulse 
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Figure  6.  Ambiguities  of  rectangular  and  Gaussian  waveforms  out  to  mach  3 

•  significant  doppler  tolerance; 

•  very  low  range  resolution. 

To  reiterate,  a  long  pulse  is  used  here  and  better  range  resolution  could  be  obtained  with  a 
shorter  pulse.  However,  compared  to  many  other  waveforms  both  of  these  have  significant 
weaknesses. 

3.1.2.  Chirp  waveforms.  Perhaps  the  next  most  common  waveform  is  called  a  “chirp”  or 
“linear  chirp”.  It  is  a  signal  with  a  linear  increase  or  decrease  in  frequency  over  time.  Using 
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Figure  7.  dB  plots  of  ambiguities  of  rectangular  and  Gaussian  waveforms 


the  fact  that  we  have  effectively  real  and  complex  (or  I  and  Q  channels)  it  is  possible  to 
transmit  the  signal  (rather  than  just  its  real  part,  say).  The  real  part  is  shown  in 

Figure  8,  along  with  its  auto-  correlation  and  the  ambiguity  of  the  complex  signal  is  shown  in 
Figure  8.  It  will  be  seen  that  the  ambiguity  of  the  chirp  has  a  ridge  running  across  the  diagram 
which  has  quite  a  sharp  peak,  however,  it  does  have  a  slight  fall  from  left  to  right  across  the 
image.  This  gives  rise  to  an  ambiguity  between  range  and  doppler.  The  sharpness  of  the 
peak  is  a  very  useful  feature  of  the  chirp  It  is  used  in  many  applications  and  in  particular  in 
synthetic  aperture  radar.  As  we  have  said,  radar  engineers  describe  waveforms  with  auto¬ 
correlations  looking  like  that  of  the  chirp  as  pulse  compression  waveforms.  They  have  the 
property  that  they  behave  to  some  extent  (at  least  for  low  doppler  values)  like  a  very  sharp 
pulse  while  retaining  high  energy. 

3.1.3.  Barker  sequences.  The  fourth  favourite  waveform  is  based  on  Barker  codes.  Such  a 
code  is  a  sequence  of  ±ls  with  the  property  that  the  auto  correlation  takes  just  the  values 
0,  ±1  and  the  length  of  the  code.  For  example  the  code  [1  —  1111]  has  auto  correlation 

[101050101]. 

Barker  codes  have  almost  perfect  auto  correlation  properties  and  quite  doppler  intolerant 
ambiguity  properties.  The  ambiguity  is  shown  in  Figure  10.  Unfortunately  they  are  only 
known  to  exist  in  lengths  3, 5, 7, 1 1  and  13.  Much  mathematical  effort  has  gone  into  trying  to 
show  that  these  are  the  only  lengths  of  Barker  codes  and  much  computing  effort  into  finding 
longer  ones  without  success  in  both  cases. 


Distance  in  raetfes 
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Real  part  of  a  chirp  waveform  Auto-correlation  of  a  chirp  waveform 


Figure  8.  Chirp  waveform  and  auto-  correlation 


X  10'*  Ambiguity  of  a  chirp  waveform 
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Figure  9.  Ambiguity  of  Chirp  waveform 


One  way  in  which  radar  engineers  have  sought  to  overcome  the  lack  of  longer  Barker 
sequences  is  to  use  so  called  “Barker  on  Barker”  sequences.  These  are  formed  by  choosing 
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X  XO* 


Ambigully  of  a  Barker  wavotorm  of  length  13 


Figure  10.  Ambiguity  of  Barker  waveform  of  length  13 


two  Barker  codes  (which  may  be  the  same)  and  substituting  a  multiple  of  one  for  each  entry 
in  the  other  using  that  entry  to  determine  the  mulltiplicative  factor.  Thus  a  Barker  of  length 
3  is  [1  —  1  —  1]  and  using  it  together  with  the  Barker  of  length  5  given  above  we  obtain  the 
following  code  of  length  15: 

[  1  1  1-1-1  1-1-1  1-1-1]. 

Repeated  substitution  of  this  kind  allows  us  to  obtain  arbitrarily  long  codes.  Unfortunately 
their  auto  correlation  and  ambiguity  properties  are  considerably  less  than  ideal.  Figure  1 1 
illustrates  this  for  a  Barker  7  on  a  Barker  13. 

3.1.4.  Costas  Arrays.  Costas  arrays  are  used  in  the  design  of  stepped  frequency  radars;  that 
is,  radars  in  which  the  waveform  comprises  a  sequence  of  pure  tones  at  differing  frequen¬ 
cies.  Thus  a  chosen  collection  of  N  frequencies  f  =  (/i,  /2»  •  •  •  ?  /at)^  are  transmitted  in 
consecutive  time  intervals  of  equal  duration  in  this  order.  The  frequencies  are  chosen  from 
equally  spaced  ones  0  =  (0i,  02j  •  •  •  ?  so  that 

(3.1)  4>j  =  <^>1  +  (j  -  1)(02  -  0i)- 

The  choice  of  order  can  be  described  bya  permutation  matrix  P  so  that  =  P0.  In  order  to 
obtain  good  ambiguity  performance,  the  matrix  P  is  chosen  so  that  for  any  shift  of  the  matrix 
horizontally  or  vertically  the  non-zero  terms  overlap  in  at  most  one  place.  Such  a  matrix  is 
called  a  Costas  array.  This  can  be  re-expressed  in  terms  of  the  set  S: 

(3.2)  S  =  {(i,j)\Pij  =  1}. 
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Velocity  in  metres  per  second 


Figure  1 1 .  Ambiguity  of  a  “Barker  on  Barker”  waveform 


Now  P  is  a  Costas  array  if,  for  r,  s,  r',  s'  distinct  elements  of  5,  no  equation  of  the  form 

(3.3)  s  —  r  =  s'  —  r' 
can  hold. 

There  are  several  ways  to  construct  Costas  arrays  (see  Golomb  [8]).  Here  is  one  due  to 
Golomb.  Let  a  and  0  be  two  distinct  primitive  elements  in  the  Galois  field  GF(p^),  and 
letP  be  the  matrix  whose  entries  are  0  except  at  the  elements  of  the  set 

(3.4)  A  =  :  a*  H-  =  1  (1  <  i  <  q  —  2)}. 

Then  P  is  a  permutation  matrix  and  indeed  a  Costas  array.  Figure  3.1.4  is  an  example  of  a 
Costas  array  of  size  31  and  Figure  3.1.4  is  its  ambiguity. 

3.2.  Complementary  Waveforms 

Following  the  view  that  the  ideal  waveform  is  a  thumb-tack,  we  should  require  of  such 
a  waveform  that  at  least  its  auto-  correlation  be  a  spike  at  the  origin  and  zero  elsewhere. 
As  we  have  seen,  the  chirp  approximates  this  quite  well.  No  single  waveform  can  achieve 
this  perfect  auto  correlation,  though  Barker  waveforms  approach  the  ideal.  It  is  possible 
however  to  find  a  pair  of  waveforms  which  have  the  property  that  if  transmitted  separately, 
each  correlated  against  a  copy  of  itself  and  the  results  added  then  we  do  obtain  a  perfect  spike. 
Such  waveforms  are  called  complementary  pairs.  We  require  of  a  pair  of  waveforms  wi  (t) 
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Figure  12,  Costas  array 


Figure  13.  Costas  array 


and  W2{t)  that 

corr(ti;i,  iui)(t)  +  corr(i02,  ti;2)(t)  =  A^Ct), 

where  At  is  a  short  (triangular)  spike  at  0  of  width  T.  This  is  equally  expressed  in  terms  of 
convolutions: 

(3.5)  Wx  ^Wx{t)-\-W2*  W2(t)  =  Arit), 

where  w(t)  =  w{—t).  As  we  have  said,  it  is  possible  to  find  such  pairs. 

In  fact  it  is  possible  to  find  a  pair  of  discrete  codes  pi  and  p2  which  are  finite  sequences 
of  ±ls  satisfing 

(3.6)  corr(pi,pi)(A:)  +  corr(p2?P2)(fc)  =  0  except  when  fc  =  0, 
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and  from  these  easily  construct  waveforms  satisfying  (3.5).  The  classical  construction  of 
these  is  due  to  Golay  and  independently,  Shapiro.  It  is  an  inductive  construction  starting  with 
the  two  codes 


=  [1. 
=  [1, 


1] 

-!]• 


Then  longer  codes  are  constructed  by  the  formula: 


pf'=[pr^  pr^>] 

pW  =  [pf-),  -p(^-)]. 


Our  intention  is  to  transmit  the  two  waveforms  in  separate  channels  and  on  reception 
keep  them  separate  and  matched  filter  each  against  the  corresponding  transmitted  form.  The 
appropriate  ambiguity  function  for  such  a  pair  of  codes  is  the  sum  of  the  ambiguities  of 
each  code  separately.  By  MoyaTs  identity,  since  pi  and  p2  are  orthogonal,  so  are  xpi 
Xp2-  After  normalisation  so  that  the  sum  of  the  energies  in  pi  and  p2  equals  1,  we  have  the 
following: 

(3.7)  lIXp.  +  XP.IP  =  lIXpJP  +  llXp^ir  =  IIPill"  +  IIP2II"  = 

This  shows  that  the  ambiguities  of  complementary  waveforms  already  have  some  chance  of 
being  more  thumbtack-like  than  single  channel  waveforms,  since  the  corresponding  answer 
for  a  single  channel  waveform  is  1,  and  the  height  at  the  centre  ((0, 0))  is  the  same  in  each 
case.  The  ambiguity  of  a  complementary  pair  is  illustrated  in  Figure  14.  In  fact  at  dopplers 
of  interest  in  typical  applications  (up  to  mach  3,  say)  complementary  pairs  are  quite  doppler 
tolerant.  What  is  remarkable  is  that  at  all  dopplers  the  range  ambiguity  is  zero  outside  half 
the  range  of  the  waveform. 

It  may  seem  that  complementary  sequences  offer  ambiguity  properties  superior  to  single 
waveforms.  However  there  are  serious  issues  involved  in  the  implementation  of  these  wave¬ 
forms.  How  does  one  maintain  a  significant  separation  of  the  two  complementary  sequences? 
If  they  are  separated  in  time,  then  the  target  must  remain  coherent  over  the  time  span  of  the 
transmission  for  the  advantages  to  be  worthwhile.  Separation  in  frequency,  while  feasible, 
also  carries  with  it  disadvantages.  The  responses  of  targets  are  frequency  dependent  as  is 
attentuation  through  the  atmosphere.  Moreover,  for  extended  targets,  the  effects  of  doppler 
change  with  different  frequencies.  These  result  in  degradation  from  the  ideal  situation  de¬ 
scribed  here.  The  complexity  of  the  electronics  is  also  increased.  One  final  observation: 
ultimately  there  is  really  only  one  signal  transmitted,  whether  it  be  spread  over  a  long  time 
period  (as  time  separation  would  require)  or  have  a  high  bandwidth  (in  the  case  of  frequency 
separation).  The  use  of  complementary  waveforms  is  merely  a  device  to  manipulate  the 
ambiguity  function  of  a  single  waveform  to  push  the  lobes  into  more  desired  parts  of  the 
range-doppler  plane. 
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Ambiguity  of  a  Gotay  pair  of  length 


Figure  14.  Ambiguity  of  a  Golay  pair  of  length  64 


It  is  possible  to  go  beyond  just  two  waveforms.  We  recall  the  construction  of  the  PONS 
matrix.  This  works  by  analogy  with  the  Golay  pairs  but  produces  2”  waveforms  of  length 
2”  which  are  complementary  in  pairs,  so  that  one  could  use  any  number  of  complementary 
sequence  pairs  separated  by  frequency  or  time  or  a  mixture  of  both.  The  recursive  method  of 
construction  is  to  take  two  “parent”  codes  Si  and  S2  of  length  /,  say,  and  use  them  to  construct 
four  “children”  of  double  the  length: 


Si  •  S2 

51  •  -S2 

52  •  Si 
— S2  *  Si, 

where  •  indicates  concatenation.  It  is  relatively  straightforward  to  see  that,  if  the  two  parent 
codes  are  orthogonal,  so  are  the  four  children.  Classical  PONS  is  now  formed  by  starting 
with  the  two  codes 

1  1 
1  -1 

and  repeatedly  applying  the  above  construction. 

Of  course  the  issues  raised  in  the  preceding  paragraph  are  correspondingly  exacerbated, 
but  for  N  waveforms  the  power  in  the  ambiguity  is  now  (by  a  repeat  of  calculation  (3.7)) 
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just  1/iV.  As  Figure  15  illustrates,  when  we  use  half  of  the  PONS  matrix  of  length  64  the 
improvement  in  ambiguity  is  remarkable.  We  have  perfect  range  sidelobes  at  all  dopplers. 


X  1  Ambiguity  of  first  32  PONS  sequences  of  length  64 


Figure  15.  Ambiguity  of  the  first  32  rows  of  the  PONS  matrix  of  length  64 
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The  Mathematical  Theory  of  Wavelets 
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St.  Louis,  MO  63130 

Abstract.  We  present  an  overview  of  some  aspects  of  the  mathematical  theory  of 
wavelets.  These  notes  are  addressed  to  an  audience  of  mathematicians  familiar  with 
only  the  most  basic  elements  of  Fourier  Analysis.  The  material  discussed  is  quite 
broad  and  covers  several  topics  involving  wavelets.  Though  most  of  the  larger  and 
more  involved  proofs  are  not  included,  complete  references  to  them  are  provided.  We 
do,  however,  present  complete  proofs  for  results  that  are  new  (in  particular,  this  applies 
to  a  recently  obtained  characterization  of  “all”  wavelets  in  section  4). 

1.  Introduction 

A  wavelet  is  a  function  ?/;  in  (M)  such  that  the  system 
(1.1)  tpjk  (a:)  =  2^’/^  -  k) 

j,  k  eZ,  is  an  orthonormal  basis  for  (R).  Observe  that  if  is  the  translation  operator 
mapping  'll;  into  {Tkip){x)  =  ip  {x  —  k),  keZ,  and  is  the  dilation  operator  defined  by 
{D^  ip){x)  =  2^/^  Ip  {2^x),  then  the  system  {ipjk}  is  obtained  by  first  applying  the  translation 
Tk  to  Ip  and,  secondly,  the  dilation  to  the  function  Tkip.  As  we  shall  see  later  on,  it  is 
important  to  respect  this  order  of  applying  these  operators:  the  translation  operator  is  applied 
before  the  dilation  operator. 

Two  examples  of  wavelets  were  known  for  a  long  time:  the  Haar  wavelet  and  the  Shannon 
wavelet.  The  former  is  the  function 


{1,  if0<a;<l/2 
-1,  if  1/2^  a;  <1 
0,  elsewhere 
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The  latter  is  the  function  ip  whose  Fourier  transform  is 


(1.3) 


1,  if$e(-l, -1/2]  U  [1/2,1) 
0,  elsewhere 


The  Fourier  transform  we  shall  use  is  given  by  the  equality 


/oo 

dx 

■00 

whenever  /eL^{E).  We  assume  that  the  reader  is  acquainted  with  the  basic  L^-  the¬ 
ory  of  the  Fourier  transform.  In  particular,  (Tkf)^{0  =  and  (?)  — 

2”'^/^  /  (2“^  ?).  Thus,  translations  by  k  are  converted  by  the  Fourier  transform  into  modu¬ 
lations  by  —k  (multiplication  by  the  dilations  become  the  dilations  D~^  after 

taking  the  Fourier  transform.  The  Plancherel  theorem,  the  fact  that  {2^5},  j  e  Z,  is  a  partition 
of  R  —  {0}  when  S  =  (-1,  —1/2]  U  [1/2, 1),  and  the  completeness  of  the  system 
/:eZ,  in  jL^(5),  immediately  imply  that  the  Shannon  function  'ip  in  (1.3)  is  a  wavelet.  That 
the  Haar  function  defined  in  (1.2)  is  a  wavelet  has  been  well  known  since  it  was  introduced 
in  1910  [Ha].  In  any  case,  this  is  an  easy  application  of  the  characterizations  of  wavelets  we 
shall  present. 

In  the  early  eighties  many  different  constructions  of  wavelets  were  discovered.  This  in¬ 
cluded  several  other  similar  methods  of  reproducing  functions.  For  example,  pairs  of  systems 
{(pjk}  and  {'ipjk},j,  keZ,  were  introduced  so  that  for  any  fe  L^(R)  we  have  the  reproducing 
formula 


(1.5)  f=^(f,4>jk)'>Pjk 

j,k€Z 

for  all  /eZ/^(R). 

We  will  present  a  careful  accounting  of  who  produced  the  results  we  describe  throughout 
the  text  as  well  as  in  an  appendix  at  the  end  of  this  exposition.  Soon  after  the  “new  wavelets” 
were  introduced  it  became  apparent  that  they  had  important  applications  in  various  different 
areas.  This  attracted  many  investigators  whose  principal  interest  was  in  these  applications. 
Perhaps  this  detracted  attention  from  the  mathematical  theory  that  is  associated  with  wavelets 
and  similar  concepts.  Our  purpose  is  to  present  some  of  this  theory.  It  is  our  belief  that  it  is  a 
beautiful  subject  connected  to  many  areas  of  mathematics. 

It  is  clear  from  the  little  that  has  been  presented  so  far  that  the  Fourier  transform  must 
play  an  important  role  in  the  study  of  bases  and  similar  systems  that  are  constructed  by 
applying  translations,  dilations  and  modulations  to  a  specific  function.  Let  us  illustrate  this  by 


G.  Weiss  and  EM  Wilson  /The  Mathematical  Theory  of  Wavelets 


331 


presenting  a  characterization  of  those  tp  e  L^(E)  such  that  {'ipj,k}yj,  keZ,  is  an  orthonormal 
system: 

Proposition  I.  Suppose  ip  e  L^(E).  Then  {'Ip(.  —  k)  :  keZ}  is  an  orthonormal  system 
if  and  only  if 


(A)  ^  I  +  fc)  I  ^  =  1  fora.e.  $ eE. 

keZ 

The  proof  of  this  fact  is  very  simple.  The  orthonormality  condition  is  (.  —  j),  ^  — 

1) )  =  Sji,  which,  by  the  Plancherel  theorem,  is  equivalent  to 

<5,7=  [ 

Jr 

We  can  then  “periodize”  this  integral  so  that  it  takes  the  form 

fceZ 

and  we  see  that  the  orthonormality  condition  is  equivalent  to  the  statement  that  the  1 -periodic 
function  X)feeZ  1  (?  +  ^)  which  clearly  belongs  to  ([0, 1)),  has  Fourier  coefficients  0 

corresponding  to  all  non-zero  frequencies  and  the  zero-coefficient  is  1.  But  this  is  equality 
(A). 

Proposition  II.  The  systems  {ipj^^k}  cmd  {tpj^^k}ykeZ,  are  orthogonal  to  each  other 
whenever  j  I  ^  j2  if  and  only  if 


(B)  ^  tp  k)  tp  (2J'(^  +  /e))  =  0  for  a.e.  ^  e  E  whenever  j  ^  1. 

keZ 

By  a  change  of  variable  the  orthogonality  condition  can  be  reduced  to  the  case  ji  =  0  and 
^2  ^  1.  A  periodization  argument,  just  like  the  one  we  just  described  then  gives  us  equality 
(B). 

Thus,  we  see  that  the  characterization  of  all  wavelets  is  reduced  to  finding  a  condition 
that  implies  the  completeness  of  the  system  j,  keZ.  It  turns  out  that,  again,  a  simple 
equality,  involving  the  Fourier  transform  of  ip,  provides  us  with  such  a  characterization  of 
completeness: 

Proposition  III.  (The  characterization  of  all  wavelets  in  Jv^(E)).  A  function  ip  e  L^(R) 
is  a  wavelet  if  and  only  if  the  system  {ipjk}  is  orthonormal  and 
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(C)  =  1  fora.eitR. 

jcZ 

Unlike  Proposition  I,  this  result  is  not  immediate.  It  is  also  quite  new.  We  shall  discuss 
its  proof  in  the  sequel.  For  the  moment,  let  us  make  some  observations. 

The  characterization  of  orthonormality  involved  averaging  (summing)  over  the  group  of 
integral  translations.  Since  the  group  of  dyadic  dilations  also  plays  a  basic  role  in  the  defini¬ 
tion  of  a  wavelet  it  is  natural  to  expect  that  averaging  over  this  last  group  plays  a  part  in  this 
characterization.  In  fact,  this  is  precisely  what  is  the  case  in  equality  (C).  What  is  surpris¬ 
ing,  however,  is  that  there  is  a  characterization  of  all  wavelets  that  involves  only  sums  over 
dilations: 

Proposition  IV.  (Another  characterization  of  all  wavelets)  Suppose  'ip  e  then  ip 

is  a  wavelet  if  and  only  (/"ll^lb  ^  1,  equality  {C)  is  satisfied,  and 


(D)  (0  =  51  (?  +  9))  =  0  for  a.e.^t  R, 

whenever  q  is  an  odd  integer. 

Let  us  explain  the  role  played  by  the  hypothesis  |  |V’||2  ^  1.  Let  H  be  a  separable  Hilbert 
space  and  ^  =  {e^  :  a  e  v4}  a  countable  collection  of  vectors  in  H  {A  can  be  N  or  {(j,  k)  : 
j.keZ})  such  thax\\u\\^  =  for  each  ue//.  Such  a  collection  £  is  then  called 

a  tight  frame  (of  constant  1)  for  H.  If  l|ea|  |  >  1  for  uWaeA  then,  letting  u  =  eao »  we  have 

IkaolP  =  E  ea)P  =  lleaoir  +  E  ea)P. 

ottA  Q^ao 

Hence, 

IK„lP(l-||ea„|P)  =  E  IK.ea)P. 

a?^Q0 

Because  of  our  assumption  that  ||eaoll  ^  1,  the  left  side  of  this  equality  cannot  be 
strictly  bigger  than  0,  while  the  right  side  cannot  be  negative.  It  follows  that  UcaJI  =  1 
and  (cao,  ea)  =  0  for  all  ao  and  a  ao-  That  is,  S  is  an  orthonormal  basis  (see  pages  336-7 
of  [HW]  for  a  more  complete  account  of  these  matters). 

The  two  equalities  (C)  and  (D)  characterize  those  ip  (E)  for  which  the  system  {ipjk}^ 
k  in  Z,  is  a  tight  frame  of  constant  1  for  L^(R).  The  condition  \  \ip\\2  ^  1  assures  us  that 
this  system  is  an  orthonormal  basis;  that  is,  that  ip  is  sl  wavelet. 

The  four  equations  (A),  (B),  (C)  and  (D)  not  only  provide  us  with  a  rather  simple  charac¬ 
terization  of  all  wavelets,  but  they  are  most  useful  for  constructing  large  classes  of  wavelets. 
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For  example,  it  follows  immediately  from  (A)  or  (C)  that  if  is  a  wavelet  then  IV’  (01^1 
a.e.  Since  ||^||2  =  HV’lb  =  1  this  means  that  ^  (0  ^  0}  must  have  measure  at  least  1. 
The  Shannon  wavelet  is  an  example  for  which  this  set  has  measure  precisely  1.  It  is  natural 
to  consider  the  class  of  all  wavelets  ^  such  that  W  =  =  {$  :  (0  ^  0}  has  measure 

1.  For  such  Ip  we  clearly  must  have  |^|  =  Xv^^•  It  is  natural  to  call  the  class  of  such  wavelets 
the  collection  of  Minimally  Supported  Frequency  (MSF)  wavelets.  It  is  an  easy  exercise  to 
show  that  the  MSF  wavelets  are  characterized  as  the  class  of  all  t/j  e  L?  (E)  such  that  |^  (^)  | 
assumes  only  the  values  0  or  1  a.e.  and  equations  (A)  and  (C)  are  satisfied.  The  sets  W  = 

on  which  the  Fourier  transform  of  MSF  wavelets  is  not  zero  are  called  wavelet  sets.  (A)  and 
(C)  are  equivalent  to  the  statement: 

Theorem  (1.1).  W  is  a  wavelet  set  if  and  only  if  each  of  the  collections  {W  —  k},k  in 
Z,  and  {2^W},j  in  Z  is  a  partition  ofK 

In  the  course  of  this  exposition  the  reader  will  find  many  examples  of  wavelets  constructed 
by  the  use  of  these  4  equations  and  in  the  appendix  we  will  give  a  still  larger  class  of  wavelets 
obtained  by  these  means. 

We  shall  also  consider  the  subject  of  wavelets  tp  e  L^{W).  Not  only  will  we  show  many 
of  the  various  properties  they  enjoy,  but  we  will  generalize  the  concept  by  showing  how  other 
dilations  and  translations  can  be  used  for  obtaining  orthonormal  bases  or  tight  frames  fi*om  a 
particular  function  (or  a  collection  of  functions);  moreover,  we  will  extend  all  these  matters 
to  higher  dimensions.  In  order  to  do  this  most  efficiently  it  is  useful  to  discuss  “continu¬ 
ous  wavelets”  associated  with  R”.  For  many  considerations  the  theory  of  these  wavelets  is 
simpler. 

2.  Continuous  Wavelets  in  One  and  More  Dimensions 

Let  G  be  the  affine  group  associated  with  R  consisting  of  all 

(a,  6)  eR  x  R,  a  with  the  group  operation 

{c,d)  o  (o,  6)  =  (ac,  6 -!-“). 

This  operation  is  consistent  with  the  action  of  g  =  (a,  6)  eGona:  in  R  given  by  g{x)  = 
a{x  +  b).  Observe  that  g~^  =  (a,  b)~^  =  (a“\  —ab).  For  ip  e  L^(R)  let 

(2.1)  {Tg  xj})(x)  =  -f=  =  -f=  il^{g-\x))  =  1pa,b{^)- 

V|al  “  Vl«l 

Then  g  — y  Tg  is  a  unitary  representation  of  G  acting  on  (R). 

The  mapping  taking  /  e  (R)  into  the  function 
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{W^f){9)=  [  f{x)(Tg^){x)dx  =  (/,  V'o.i.) 

Jr 

on  G  is  the  (continuous)  wavelet  transform  off.  A  goal  in  wavelet  theory  is  to  find  a  condition 
on  ^  that  allows  us  to  reconstruct  /  from  its  wavelet  transform  via  the  reproduction  formula 


(2.2)  /(x)  =  [  {f,i>g)i>g{x)d\(g)  =  f  (W^f){g)Tg4i{x)d\{g), 

Jg  Jg 

where  A  is  (left)  Haar  measure  on  G  (dX{a^  b)  —  One  can  consider  (1.5)  as  a  discrete 
version  of  this  reproduction  formula  when  (p 

This  condition,  the  admissibility  condition  for  xp,  was  discovered  by  Calderdn  in  1964  [C] 
and  can  be  expressed  in  the  form 


(2.3) 


-j 


da 


R-{0} 


for  a.e.  f .  We  will  show  the  equivalence  of  (2.2)  and  (2.3)  in  a  considerably  more  general 
context. 

It  is  clear  that  (2.3)  is  the  “continuous”  analog  of  equality  (C).  In  fact  it  is  much  more 
than  an  analog.  Suppose  that  'ipeL’^  (R)  satisfies  (C),  then 


for  ^  >  0.  Thus,  it  follows  that,  after  a  renormalization,  tp  satisfies  (2.3). 

This  shows,  essentially,  that  each  wavelet  is  also  a  continuous  wavelet.  On  the  other 
hand,  it  is  clear  that  the  converse  is  not  true;  being  a  wavelet  is  more  restrictive  than  being  a 
continuous  wavelet. 

Let  us  also  observe  that,  in  the  continuous  case,  the  order  of  the  operation  of  translation 
(by  -b)  followed  by  dilation  (by  i),  as  performed  in  the  definition  of  ipa,b,  can  be  reversed 
and  we  would,  again,  have  that  the  same  admissibility  condition  (2.3)  is  equivalent  to  the 
reproducing  formula  (2.2).  More  precisely,  if  G  is  endowed  with  the  operation  (a,  6)  •  (c,  d)  — 
(ac,  ad  +  b)  (that  corresponds  to  the  action  x  — >  ax  +  bon  E)  and 

{Sgil>)(x)  =  -^  i)(g~'x)  =  =  ifa,b  (x), 

^/\a\  s/\a\  a 
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we  have 
(2.4) 


ta,b)i>a,b{x) 


dadb 


=  f  (/,  ^a,b)  A,b  (x) 

Jo  “ 


for  all  /  e  L^(R)  {G  is  the  “new”  version  of  the  affine  group  with  this  last  multiplication,  so 
that  its  left  Haar  measure  is  ^). 

Let  us  now  pass  to  the  extensions  of  these  notions  and  results  to  n  dimensions.  The  Full 
Affine  Group  of  Motions  on  R",  consists  of  all  pairs  (a,  6)  eGL(n,  R)  x  R”  (endowed 
with  the  product  topology)  together  with  the  operation 


(a,  /?)  •  (a,  h)  ~  (aa,  6  +  a  ^  jS), 

This  operation  is  associated  with  the  action  x  — >  a{x-\-b)  on  R”.  The  subgroup 

Af={{a,b)6G*  :a  =  fbeMr} 

is  clearly  a  normal  subgroup  of 

We  consider  a  class  of  subgroups,  {G},  of  G^  of  the  form 

G  =  {{a,b)eG*  laeD^beW}, 

where  D  is  a  closed  subgroup  of  GL(n,  R) .  We  can  identify  D  with  the  subgroup  {(a,  6)  e  G  : 
aeD,b  =  0})  of  G.  We  refer  to  D  as  the  dilation  subgroup  and  M  will  be  called  the 
translation  subgroup  ofG. 

If  p  is  left  Haar  measure  for  Z),  then  dA(a,  b)  =  dp{a)db  is  the  element  of  left  Haar 
measure  for  G. 

Let  T  be  the  unitary  representation  of  G  acting  on  (R”)  defined  by 


(2.5)  (T(„,j,)  V’)(x)  =  I  det a\  il>{a  ^x-b)= 

for  (a,  6)  e  G  and  'll;  eLP’  (R”).  Observe  that  (a,  6)“^  =  (a“^,  —ab).  We  then  have 


(2.6)  {T(a,6)^r(0  = 

where  a*  is  the  transpose  of  a. 

The  wavelet  transform  associated  with  'ip  is  now  defined  by 

(WV/)(a,6)=  (/.V’a.b)  =  [  f{y)W{^^=V^-7=^ 

Zr”  y|dettt| 

whenever  /  e  (R" )  and  (a,  6)  e  G.  Our  first  goal  is  to  find  an  admissibility  condition  for 
that  guarantees  the  general  version  of  the  Calderon  reproducing  formula 
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(2.7)  f{^)=  f  {f,'fpa,b)'ipa,b(x)dX{a,b) 

Jg 

for  all  /  e  (R").  The  analog  of  (2.4),  involving  the  operation  {a,  /?)  o  (a,  b)  =  (aa,  ab-\-/3), 
is  valid  in  this  general  case;  hence,  the  same  admissibility  condition  applies  to  both  “versions” 
of  G.  This  condition  is 

Theorem  (2.1).  Equality(2J)  is  valid  for  all  feL‘^{W)  if  and  only  iffora.e.  ^  ^  0 


(2.8)  A^(e)=/’  |^{o*0pd/‘(a)  =  l. 

Jd 

(compare  with  (2.3)). 

The  following  argument  also  provides  a  (weak)  meaning  for  (2.7). 

W.^  obviously  maps  into  L^{G).  We  claim  that,  if  (2.8)  is  satisfied,  then  is 

an  isometry  from  L^(E")  into  L^((j,  A)  : 

\\W^f\\lHa.x)=  f  (  \{},i>a,i,)?dbd^i{a)  = 

JD 

f  [  \  f  /  (0  I  deta|  dbdfi  (a) 

—  [  i  f  \{f  ip{0'*-)y(ab)\‘^  \deta\db]dii{a) 

Jd  Jr^ 

=  f  if  l{M«*-)r(^)N^lc^/^(«) 

Jd  Jr^ 

-  /  /  \f  {0?ma*0?d^d  11(a)  =  f  l/(0pA^(0de 

Jd  Jr’^ 

-\\f\\l  =  \\f\\l 

By  polarization,  therefore,  we  have 

(2.9)  (W^  /,  =  (/> 

for  all  /  and  g€L'^(W).  In  particular,  the  adjoint,  of  is  a  left  inverse  of  : 

=  J.  We  also  have  shown  that  the  reproducing  formula  is  valid  in  the  weak  sense. 
We  refer  the  reader  to  [S]  for  more  general  versions  of  this  reproducing  formula. 

Suppose,  on  the  other  hand,  satisfies  (2.9)  (so  that  (2.7)  is  valid  in  the  weak  sense)  and 
^0  is  a  point  of  differentiability  for  the  integral  of  A^.  Let  |/  (f)p  =  |Br(^o)r^  (0» 
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where  Br  (?o)  is  the  ball  of  radius  r  >  0  centered  at  ^o-  Reversing  the  equality  chain  used  to 
obtain  (2.9)  we  see  that 

~\Q~ (t  \ I  [  =  i 

l^rUojl  JBri^o) 

for  all  r  >  0.  Letting  r  — ^  0  we  obtain  (Co)  =  L  Since  a.e.  CeR”  is  such  a  point  of 
differentiability,  the  admissibility  condition  (2.8)  is  true  and  the  theorem  is  established. 

It  is  natural  to  ask:  for  what  dilation  groups  D  does  there  exist  satisfying 

the  admissibility  condition  (2.8)?  We  shall  call  such  groups  admissible.  When  n  =  1  and  Z) 
is  the  group  of  non-zero  numbers  (or  positive  numbers)  with  multiplication  being  the  group 
operation,  it  is  clear  that  the  admissibility  condition  is  verified  by  any  ij)  such  that  ^  is  a 
bounded  function  supported  in  a  compact  set  in  R  -  {0}  (appropriately  scaled).  Thus,  there 
exists  ^eL2(R")  satisfying  the  admissibility  condition  .  For  the  same  reason,  in  R”,  the 
group  D  =  {al :  a eR  ~  {0}}  is  also  admissible.  The  group  50(2),  acting  on  R^,  however, 
is  not  admissible.  For  if  this  group  were  admissible,  then  there  exists  e  ^^(R^)  such  that 

1=  T"  \iji(pe^^)\Ue. 

Jo  Jo 

Thus, 

poo  poo  p2TT 

Jo  ^  ii 

which  is  clearly  impossible. 

However,  the  1 -parameter  groups 

(2.10)  D=ipi  =  e^‘'  -.teR,  L  = 
and 

(2.11)  £»=|(  Q  ^)  :a:^0,x,2/eR| 
are  admissible. 

The  following  (almost)  characterization  of  the  admissible  groups  D  can  be  applied  to  see 
the  validity  of  our  claim  about  the  last  two  examples. 

Theorem  (2.2).  Let  D  be  a  closed  subgroup  of  DL{n,  R)  and  consider  the  right  action 
^^a*^ofGonWfor^eR^.  Let 

D|  =  {o6D:||a‘e-^|K€} 

be  the  e-stabilizer  of^,  for  e'^0,  and  let  be  the  stabilizer  of^.  If  either 
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i.  \dei  a\  =  A  (a)  for  all  a  e  D,  or 

ii.  {^  €  R”  :  is  noncompact}  has  positive  Lebesgue  measure 

holds,  then  D  is  not  admissible.  If  both  i  and 

iii.  {^eR”  :  is  non-compact  for  all  e  >  0}  has  positive  Lebesgue  measure 

fail,  then  D  is  admissible. 

(A  is  the  modular  function  of  D  :  the  Radon-Nikodym  derivative  d  pi/dpr^  where  pi  is  left 
Haar  measure  and  pr  is  right  Haar  measure  on  D,  normalized  so  that  A  (/)  =  1,  where  I  is 
the  identity  element  of  D.)  The  proof  is  by  no  means  immediate  and  will  appear  in  [LWWW]. 

When  D  is  given  by  (2.1 1)  and  ^  —  (fi,  6)  with  ?i  0  we  have 

^<  =  {(o  1  )  + 

This  is  compact  for  any  positive  e,  so  condition  2  fails.  Furthermore,  for  a  = 

dpi  {a)  =  dxdyjx^  and  dpr  {a)  =  dxdy /  \x\  so  A{a)  =  d pi{a)/d pr  (a)  =  l/ldeta|  and 
so  condition  i  is  also  invalid.  Hence  D  is  admissible.  When  D  is  a  general  1 -parameter 
subgroup  of  GL{n;U)  (i.e.,  D  =  :  teR}  for  some  nx  n  matrix  L),  D  is  unimodular 

and  det  (e*^)  =  so  i  holds  tr(L)  =  0.  In  this  case  D  is  not  admissible.  When 

tr{L)  ^  0,  i  fails  and  it  is  easy  to  check  that  2  also  fails,  so  D  is  admissible. 

A  homogeneous  Galilei  group  is  a  subgroup  D  of  GL{n  +  1,  R)  which  is  of  the  form 
(2.1 1)  with  y  replaced  by  an  n  x  1  column  vector  (i.e.,  a  member  of  R")  and  x  replaced  by  an 
invertible  n  x  n  matrix  satisfying  various  stipulations;  e.g.,  x  is  an  orthogonal  matrix  or  the 
product  of  an  orthogonal  matrix  and  a  non-zero  scalar.  The  subgroup  G  of  the  affine  group  on 
R"+^  whose  dilation  group  is  D  and  whose  translation  group  includes  all  translations 
is  then  an  inhomogeneous  Galilei  group. 

Using  Theorem  (2.2),  it  follows  easily  that  D  is  not  admissible  if  we  allow  x  to  be  or¬ 
thogonal  while  D  is  admissible  if  we  allow  x  to  be  the  product  of  an  orthogonal  matrix  and  a 
non-zero  scalar.  In  the  second  case,  the  family  {Tg'ip  :  geG]  determined  by  any  continuous 
wavelet  'ip  and  the  reproducing  formula  associated  with  this  family  provide  examples  of  what 
are  known  in  physics  as  coherent  states.  The  admissibility  conditions  for  certain  classes  of 
these  groups  have  been  obtained  by  several  authors.  In  general  these  derivations  are  rather 
complicated  (see  [Co]);  theorem  (2.2)  does  provide  a  simpler  and  more  unified  method  for 
solving  these  problems. 

As  an  introduction  to  the  notion  of  “discretizing”  continuous  wavelets  let  us  observe  that 
the  proof  of  Theorem  (2.1)  in  the  special  case  n  =  1  and  D  the  group  {2^  :  j  e  Z}  shows  that 
(C)  in  Proposition  III  is  equivalent  to  the  reproducing  formula 
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(2-12)  /  = 

jeZ 

where  =  2-^/^ ^(2%  —  t)  for  j eZ  and  teR.  That  is,  (2.12)  is  a  “discretization”  of 

.  The  replacement  of  the  integral  over  R 

in  (2.12)  by  the  sum  ^  (/,  )'ipjk  gives  us  the  orthonormal  wavelet  expansion  of  /  if  is 

keZ 

such  a  wavelet.  It  turns  out  that  there  are  other  discretizations  of  (2.12).  For  example,  if  n  is 
an  odd  integer  and  is  an  o.n.  wavelet,  then 

jeZ  keZ 

with  convergence  in  L^(R)  (see  [CS]).  This  is  an  example  of  a  phenomenon  known  as  over- 
sampling.  Letting  n  tend  to  oo  we  obtain  (at  least  formally)  the  equality  (2.12)  which  can 
be  thought  of  as  “the  ultimate  oversampling  property”  of  a  ^  satisfying  (C).  There  are  sev¬ 
eral  examples  of  discretizations  of  the  continuous  wavelet  properties.  In  order  to  appreciate 
the  complexity  of  “discrete”  over  “continuous”  wavelets  we  now  present  generalizations  to 
non-dyadic  wavelets,  wavelet  systems,  and  related  families  in  n-dimensions. 

Let  r  =  be  a  lattice  in  R"  (P  any  invertible  nxn  matrix)  and  Aannxn  dilation 
matrix  (each  eigenvalue  A  of  A  satisfies  |A|  >  1)  for  which  C  P.  For  V^€L2(R”)  let 
^  =  {4^3,1  :  .7  e  7 e  r},  where 

(2.13)  (^)  =  I  det  A  il}{A^x  — 

S  is  the  affine  system  generated  by  the  lattice  P,  and  the  dilation  matrix  A.  -0  is  a  wavelet 
relative  to  P  and  A  if  and  only  if  5  is  an  orthonormal  basis  of  L^(R”).  We  shall  also  use 
the  further  notations:  P*  =  {7'  e  R"  :  (7',  7)  e  Z  for  all  7  e  P}  and  P  =  A%  the  transpose  of 
A.  Then  P  P*  C  P*.  Let  S  be  the  set  difference  P*  \  P  P*.  We  then  have  the  following 
generalizations  of  Proposition  III  and  Proposition  IV. 

Proposition  in'.  'Ip  is  a  wavelet  relative  to  P  and  A  if  and  only  if  its  affine  system  S  is 
an  orthonormal  set  in  L^(R”)  and 

(C)  ^  ?)  p  =  I  det  P|  for  a.e.  f  £  K" . 

jeZ 

Proposition  IV'.  The  affine  system  E  generated  by  xp  is  a  tight  frame  of  constant  1  if 
and  only  if  equality  (C)  is  satisfied  and,  for  each  seS, 


(2.2)  with  the  sum 


replacing  the  integral  [  — 
7:^  Jr  W 
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(DO  ^  ^  +  s))  =  0  for  a.e.  f  e  E” . 

^  w  a  wavelet  if  and  only  if  (CO  and  (DO  fzr-e  satisfied  with  ||^||2  ^  1. 

Now  suppose  A  is  of  the  form  for  some  n  x  n  real  matrix  L;  this  is  a  very  mild  extra 
condition  on  A,  e.g.,  it  is  automatically  satisfied  if  A  has  no  negative  eigenvalues.  Write  A* 
for  and  D*  for  (A*)*  =  Then  D  =  {A*  :  feE}  is  a  one  parameter  subgroup  of 
CL(n,  E)  with  Haar  measure  d  fi  (AO  =  dt  If  ^  is  a  wavelet  relative  to  F  —  Z"  and  A,  then 
equation  (CO  implies 


f  \^((ArO?df,{A^)=^  r  \ip{B^0\^dt 

J  D  J  —oo 


jeZ 

for  f  eE”.  So  i/j  is  a  continuous  wavelet  relative  to  D.  It  is  easy  to  construct  examples 
where  A  belongs  to  larger  subgroups  (non  1 -parameter)  (DO  and  il)  remains  a  continuous 
wavelet  relative  to  (DO-  In  view  of  Proposition  IV',  we  again  conclude  that  it  is  relatively  easy 
for  a  function  ip  e  to  be  a  continuous  wavelet  but  that  far  more  structure  is  required 

for  to  be  a  discrete  wavelet.  We  are  also  led  to  pose  the  question  of  determining,  for  a  given 
admissible  group  D  C  CL(n,  E),  which  discrete  subgroups  of  D  and  which  E"  lattices  give 
rise  to  discrete  systems  analogous  to  E  which  are  either  orthonormal  bases  or  tight  frames  for 

Another  observation  is  in  order.  In  many  situations  it  is  appropriate  to  generate  a  wavelet 
basis  with  more  than  one  funtion  ip.  In  the  fourth  section,  for  example,  we  shall  see  that 
L  =  2*^  —  1  functions  ip^ , . . .  ,ip^  arc  needed  to  obtain  MRA  wavelets  in  n-dimensions.  This 
is  reflected  in  what  follows. 

With  r  and  A  as  above  and  with  L  an  integer  ^  1,  we  can  associate  with  each  family 
^  =  {ip^^ip"^, . . .  ,ip^}  C  L^(E'^)  an  affine  system  =  {ipj^  :  j  eZ,  ^eT,!  ^  ^  L} 

where  ip^j^  is  defined  for  ip^  by  (2.13).  In  addition,  ^  generates  a  quasi-affine  system  = 
{ipy^  :  jeZ,7er,l  ^l^L)  where 


ifj^O 

|detAp>'(AJ(x-7))  ifj<0 


} 


Recall  that  an  arbitrary  collection  {ca  :  ae  A}  in  is  a  Bessel  family  if  there  is  a 

constant  (7  >  0  such  that 
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^C'||/||2  forall  /eL^(R"). 

at  A 

Suppose  ^  . . . ,  and  $  =  {ip^,  •  •  • ,  are  two  families  in  L^(R”)  for  which 

the  affine  systems  and  5$  are  Bessel  families.  Then  $  is  an  affine  dual  of  ^  if  is  dual 
to  Eif!  in  the  sense  that  for  all  f.gcL^  (M" ) , 

(2- 14)  {f’9)  =  J2'^Y,  (f’ 

i=l  jcZ  7er 

$  is  a  quasi-affine  dual  of  ^  if  (2.14)  holds  when  and  are  replaced  by  and 

Theorem  (2.3).  Using  the  above  notation,  suppose  E^  and  are  Bessel  families.  Then 

$  is  an  affine  dual  of'^  if  and  only  if 


L 

(2.15)  EE 

1=1  jeZ 

for  a.e.  f  e  E”  and,  for  each  seS  =  r*\Br*, 

L 

(2.16)  tM)  =  E  E  +  =  0 

1=1 


for  a.e.  ^cE^. 

Moreover,  these  two  equations  also  characterize  the  relation  that  $  is  a  quasUaffine  dual  of 

Note  that  (2.15)  and  (2.16)  reduce  to  (CO  and  (£>0  when  £  =  1  and  =  '^  = 

^  =  {t/iO ..,  -0^}  is  a  wavelet  system  (relative  to  F  and  A)  if  E^  is  an  orthonormal  basis  of 
£2(E”);  in  particular,  (2.15)  and  (2.16)  must  hold  with  =  -0^  for  1  ^  z  ^  £. 

The  generalization  to  wavelet  systems  and  the  reversal  of  the  order  of  dilation  and  trans¬ 
lation  in  passing  from  affine  systems  to  quasi-affine  systems  raise  further  questions,  and  there 
is  reason  to  hope  this  may  be  elucidated  by  the  less  technically  formidable  investigation  of 
continuous  wavelets  for  subgroups  of  GL(n,  E).  This  is  an  area  of  active  research  which  we 
shall  not  comment  upon  further  in  these  notes.  Instead,  we  turn  to  the  techniques  needed  to 
prove  the  new  characterization  of  dyadic  wavelets  announced  in  Proposition  III  in  the  first 
section  and  the  generalized  Proposition  III'. 
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3.  Shift  Invariant  Subspaces  and  a  New  Characterization  of  Wavelets 

Proposition  III  was  stated  as  a  conjecture  by  the  first  author  in  a  seminar.  Two  students,  M. 
Bownik  and  Z.  Rzeszotnik,  proved  it  independently.  We  present  the  n-dimensional  extension 
of  an  argument  in  the  Ph.D.  thesis  of  the  latter  [R];  for  a  different  approach  see  [B2]. 

Suppose  9?  is  a  non-zero  function  in  Let  denote  the  algebraic  span  of  the 

translates  (p{'  —  k)  =  where  keZ^.  That  is,  A^  is  the  linear  space  of  all  finite  linear 
combinations  of  the  translates  r*  (p.  Let 


(3.1) 


/eZ" 


It  is  clear  that  is  a  1 -periodic  function  that  is  integrable  on  T”  —  {f  =  : 

0  ^  <  1,  j  =  1, 2, ...,  n}  (by  “1 -periodic”  we  mean  that  it  is  1 -periodic  in  each  variable). 

Moreover,  if  /  =  ^  akTk  (p  is  the  general  element  of  A^,  then 

finite 


m  = 


Conversely,  if  /  (^)  =  t{^)  with  t  a  trigonometric  polynomial,  then  f  tA^.  Thus,  for 
such  an  /  we  have 


2  _ 
2  — 


=  [  E  /  i«(on<?(ONe  = 


=  / 

Jjr 


This  shows  that  the  mapping  that  assigns  to  /  eA<p  the  unique  trigonometric  polynomial  t 
such  that  f  =  t(p  is  an  isometry  between  A^  and  the  space  of  all  trigonometric  polynomi¬ 
als  endowed  with  the  norm 


n1/2 

Thus,  has  a  unique  extension  to  an  isometry  between  the  closure  of  A^  in  L^(E” ), 
and  the  space  L^(T”,  consisting  of  all  l-periodic  functions  s  satisfying  ||s||L2(T",a;^)  < 
00.  Observe  that,  as  functions  on  T"  (or  E"  and  l-periodic),  two  elements  that  are  equal  on 
7^  0}  represent  the  same  element  of  L^(T",  cj^). 

When  /  c  then  U^f  =  s  with  s  e  L^(T”,  o;^)  and 
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(3.2)  II/II2  =  |1s||l^(t,.,), 

where  f  =  stp.  Conversely,  any  5  eL^(T”,  u)^)  gives  rise  loan  feV<p  via  the  last  equality. 

The  translates  of  (^,  Tk(p,ke  Z"  generate  V^p  in  the  manner  just  described.  In  view  of 
the  notions  we  have  been  discussing,  it  is  natural  to  ask  if  which  is  clearly  shift  invariant, 
contains  a  0  which  also  generates  and  {TkO}jke  Z*^,  is  a  tight  frame  for  this  subspace?  The 
answer  is  “yes”  and  is  easily  obtained:  Let  s{^)  =  for  ?  €  and,  say,  s($)  =  0 

outside  Cltp.  It  is  easily  seen  that  0(0  =  5(0?(0  gives  a  function  0eL^(E")  having  these 
properties:  obviously  s  e  in  fact. 

If  /  e  so  that  f  =  tip  and  9  =  sip,  then  {0jt}  =  {t*  0}  is  a  tight  frame  for  V^p  if  and  only  if 


(3.3)  = 

ibcZ« 

But,  using  PlanchereTs  theorem,  and  “periodizing”  the  integral  over  E”, 

{m=f 

Jt- 

Thus,  {(/,  0A:)}j  k  eZ”,  is  the  sequence  of  Fourier  coefficients  of  the  function  f(05(0w<p(0- 
Thus,  by  this  calculation  and  (3.2), 


(3.4)  = 

•/IT’' 

This  proves  (3.3) 

On  the  other  hand,  by  (3.2)  (with  5  replaced  by  f). 


Jjn 

From  this  equality,  (3.3)  and  (3.4)  we  have 

0=  [  m?<^Am-\sm‘'‘^A0]d( 


(3.5) 
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for  all  Choosing  t  =  xe  where  E  is  either  :  1  >  (0}  or 

:l  <  |s(^)p  (^)}  we  see  that  (3.5)  is  equivalent  to 

for  a.e.  ^  e  We  have  proved 

Lemma  (3.6).  For  each  space  ip  e  we  can  find  6eV^  such  that 

{e{--k)},keZ-, 

is  a  tight  frame  (of  constant  J)for  V^.  All  such  9  are  characterized  by  having  Fourier  trans¬ 
forms  0(^)  =  z/(^)  u)^p  (0”^'^^  ?  (0»  ^here  v  is  a  1 -periodic  unimodular function.  Moreover, 
the  tight  frame  property  of  these  9  is  characterized  by  the  equality 

(3.7)  ^|?(^  +  A:)|2 

keZ^ 

(3.8)  =A’n^(0  a.e.mR". 

The  elements  of  =  V$  are  precisely  those  whose  Fourier  transform  is  of  the  form 

t{09{0,teL\V,di). 


Remark.  Equality  (3.7)  is,  clearly,  a  more  general  version  of  equality  (A)  in  Proposition  I 
that  characterizes  the  orthonormality  of  the  system 

From  this  lemma  we  obtain  the  following  characterization  of  shift  invariant  subspaces: 

Theorem  (3.1).  Suppose  V  is  a  closed  subspace  c>/L^(E”).  V  is  shift  invariant  if  and 
only  if  there  exists  a  sequence  of  functions  {9^},  1  <  j,  belonging  to  V  that  are  mutually 
orthogonal  such  that  each  9^  generates  a  tight  frame  (of  constant  1),  —  A:)},  keZ^  for 

the  space  Vei  and 


(3.9)  V  = 

Remark.  All  but  a  finite  number  of  the  9^  can  be  the  zero  function;  in  this  case  V^j  =  {0}. 
Unless  V  is  the  trivial  space  {0},  let  us  order  the  9^  so  that  the  non-zero  ones  are  listed  at  the 
beginning. 


^The  symbol  denotes  the  orthogonal  direct  sum  of  the  sequence  of  subspaces  that  follows  it. 
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Proof,  It  is  clear  that  if  V  satisfies  (3.9)  then  it  is  shift  invariant.  Thus,  we  only  need  to 
show  that  a  shift  invariant  closed  subspace  V  satisifes  (3.9).  Toward  this  end  we  choose  a 
non-zero  (peV  (if  such  ip  exists)  and  apply  Lemma  (3.6)  to  obtain  stOeV^  satisfying  (3.7). 
We  let  0^=0  and  consider  the  orthogonal  complement  of  Vqx  in  V.  Applying  the  same 
argument  to  n  we  obtain  in  this  orthogonal  complement.  Continuing  in  this  fashion 
we  obtain  (3.9)  (if  we  wish  to  be  completely  rigorous,  we  invoke  the  separability  of  V  and 
Zorn’s  lemma). 

Now  suppose  V  is  shift  invariant  and,  thus,  equals  a  direct  sum  as  in  (3.9).  Fix  j  ^  1  and 
f  cE”.  Let  0^  (^)  be  the  vector  in  /^(Z”)  whose  coordinate  is  0^  (^  k),ktZ'^.  Since 

{0^  (-  —  A;)}fcezn  is  a  tight  frame,  equality  (3.7)  tells  us  that 

(3.10)  l|0^'(OII?2(2n,  =  Y,  (^)  = 

A:cZ« 

(in  order  to  avoid  having  to  repeatedly  add  the  expression  “a.e.”  we  tacitly  assume  that  we 
only  choose  $  in  a  subset  of  E”  whose  complement  has  measure  0  and,  for  all  such  (3.7) 
and  the  other  related  properties  we  invoke  are  valid). 

The  orthogonality  of  the  spaces  Vej ,  and  a  periodization  argument  like  the  one  that  gives 
us  equalities  (A)  and  (B),  yields 

(3.11)  (e^(O.0''(C)W")  =  o 

i  ^  /• 

Let  jC(^)  be  the  closure  in  P  (Z”)  of  the  linear  space  generated  by  the  vectors  0^  (^) ,  1  <  j. 
It  is  an  immediate  consequence  of  (3.10)  and  (3.11)  that  the  sequence  {0^  (0  }  j  ^  ^  i  is  a  tight 
frame  (of  constant  1)  for  £($)  (even  if  £($)  =  {0}),  Let  P{^)  be  the  orthogonal  projection 
of  /^(Z")  onto  £(^)  and  eo  €/^(Z”)  the  vector  all  of  whose  coordinates  are  0  except  for  the 
coordinate  corresponding  to  0  eZ”,  which  has  value  1  :  eo(A;)  =  0  if  A:  ^  0  and  eo(0)  =  1. 
Then,  using  the  tight  frame  property  for  {0^  (f)}  we  have 

1  >  =  E  KP(Oeo,  0^'  (e)>P  = 

^  Keo,  P(0  0^'  (0)p  =  E  ^  K)>l'  = 

This  shows 

Lemma  (3.11).  IfV  is  a  closed  shift  invariant  subspace  of  and  we  represent  V 

as  the  orthogonal  direct  sum  (3.9),  then 
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(3.12)  = 

The  following  result  will  be  an  essential  tool  we  shall  use  in  the  proof  of  the  principal 
result  of  this  section.  An  interesting  feature  is  that  the  dyadic  dilation  operator  D  arises 
naturally  in  this  study  of  the  properties  associated  with  the  translation  operators  Tk. 

Lemma  (3.13).  Suppose  we  have  the  same  hypothesis  as  in  the  previous  lemma.  If 
(7veL^(Ef),  then 


(3.14)  njeziyv  =  {0}. 

Proof.  Suppose  there  exists  a  non-zero  fe  D^V-,  we  might  as  well  assume  ||/||2  =  1. 
Since 


fe  D-^V 


we  must  have  D^feV  for  j  ^  0. 

IfgeV  then,  by  Lemma  (3.6)  and  equality  (3.8), 


5(0  =  0^(0 


(convergence  is  in  L^(E”),  where  each  is  a  1 -periodic  function  in  Pi  Q^j)  that  is 

uniquely  determined  on  Moreover, 


(3.15)  WSWl  —  ll^^lli2(T«nn^j)i 

j=i 

(by  (3,2)  since,  in  this  case,  In  particular,  applying  the  above  equality  for  g{^) 

and  OA5)iog  =  D^f,l^0, 

oo 

2-’“/V‘(2-'0  =  EW(0^(?)- 

where  is  1 -periodic,  supported  in  Clgj  and 

oo  C)0 

(3.16)  1  =  IPVII2  -  X/ 

j=i 


Therefore,  if  /  ^  0  we  have 
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(3.17)  /(O  =  2^!^  Y. 

J  =  1 

Let  Q  be  the  translate  of  r  =  {^  =  (6,6>---.^n)  :  0  <  <  IJ  =  l,2,...,n}by 

k  =  (ki,  ki, . . . ,  k„)  €  Z”  satisfying  ki,k2,...,k„  ^  1.  We  claim  that  /(^)  =  0  for  ^  e  Q.  To 
see  this  we  first  observe  that 


,, ,,,  /a  Er=i  iw(2'?)Prf$ = 2-"'  Er=i  = 

2-"‘2"*/.r„E;"l  K(0P<1?=1- 

This  is  a  consequence  of  the  fact  that  2^Q  is  the  disjoint  union  of  2”^  lattice  point  translates 
of  T^,  the  1-periodicity  of  mj,  and  (3.16).  Hence,  using  (3.17)  and  (3.18), 


[  f  k(2'e)ir(2'oirf?< 

Ja  JQ 

\  1/2  /  oc  \  1/2 

2ni/2  /  (  V  |g>(2<^)|2  ^  k(2'0P<l?  ^ 


^nlf2  I  2”^^ 


US 

J2^Q  .^1 


^J=l 


1/2 


dr] 


'  „  oo  \  1/2 

2-"'/  < 
.  •/2'Q  ,=i  / 


-I- 

But  the  last  expression  tends  to  0  as  /  — )■  oo  since  ay  is  integrable  and  the  points  7/  of  2^Q 
satisfy  2^y/n  <  2^|A:|  <  \r]\.  This  shows  that  /($)  =  0  when  f  =  (^i,  $2?  •  •  • )  Cn)  with  ^  1 
for  j  =  l,2,...,n. 

Since  f  eV  for  I  1  we  can  apply  this  argument  to  f{2~^  to  obtain  the  fact  that  f{() 
vanishes  in  the  entire  first  quadrant.  It  is  also  easy  to  see  that  this  proof,  with  obvious  changes, 
shows  that  /  vanishes  in  the  remaining  quadrants.  Hence,  /  and  /  vanish  a.e,,  contradicting 
the  assumption  f  ^0. 

We  are  now  ready  to  establish  the  main  result  of  this  section: 


Theorem  (3.2).  Suppose  ip  e  Z/^(R”)  and  the  system  {ipjk},3^  Z.ke’L^  is  orthonormal, 
then  this  system  is  an  orthonormal  basis  for  L'^{W)  if  and  only  if  ip  satisfies  equality  (C). 


This  means  that  equalities  (A),  (B)  and  (C)  in  section  1  completely  characterize  all  wave¬ 
lets. 
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Proof.  For  each  jeZ  let 


Wj  —  span{ipjk  :  A;eZ"}, 

where  we  assume  that  the  system  {ipjk},j  eZ,keZ'^  is  orthonormal.  As  we  explained  above 
(see  Proposition  III)  if'ipisa.  wavelet,  then  Tp  satisfies  (C).  Thus,  all  we  need  to  do  is  to  show 
that,  under  our  assumptions,  if  (C)  is  satisfied  then  ip  is  a.  wavelet. 

Clearly,  Wj  J_  Wjt  when  j  f.  Thus, 


and  we  must  show  that  {®j(Z  =  {0}.  Let 


V  =  {®j^oWjy. 

Since  each  Wj,  j  ^  0,  is  shift  invariant,  it  follows  that  V  is  shift  invariant.  We  claim  that 

(3.19)  {®j,z  Wj}^  c  nuzD'V. 

This  follows  from  the  fact  that  DW  =  {^j^iWj}^  3  {^jezWj}-^,  Thus,  it  is  natural  to 
see  if  Lemma  (3.13)  can  be  applied  in  order  to  conclude  that  DW  =  {0}.  We  have,  by 
(3.9), 

(3.20)  V  =  ®f^,V0j, 

where  the  functions  9^  satisfy  the  properties  described  in  Theorem  (3.1).  We  want  to  show 
that  ay  (0  =  is  integrable.  In  fact,  we  shall  apply  (3.9)  to 

(3.21)  L‘^{R^)  =  V  e{ei^oWi} 

which  is  clearly  shift  invariant.  Given  geZ”  and  /  >  0  there  exist  unique  fc,peZ"  with 
P  =  {PuP2y . . .  ,Pn)  satisfying  0  <  Pj  <  2^  -  1,  j  =  1, 2, . . . ,  n,  such  that  q  =  2'A:  +  p.  There 
exist  precisely  2”*  such  p  and  each  of  these  lattice  points  determines  the  orthonormal  system 

(x)}  =  {2"V^  (2'(x  -k)-p)},ke  Z", 

which  is  generated  by  the  lattice  point  translations  of  the  function  (Pp{x)  =  2"^*^^  'ip  (2^x  —  p). 
That  is,  each  space  WiJ  ^  0,  which  is  shift  invariant,  has  the  special  (3.8)  decomposition 

(3.22)  Wi  =  epV^i^, 

Putting  together  (3.20),  (3.21)  and  (3.22),  therefore,  we  have 

Voj}  e  {e/>o{©pn^^}}. 
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Applying  Lemma  (3.1 1)  we  then  must  have 


oo  oo 

j=l  p 

oo 

/=o  p 

But  (0  =  2-"'/2  and.  thus. 

Consequently,  the  sum  (^)|^  consists  of  2^^  equal  terms  and,  thus,  must  be  equal  to 

1^(2“^  We  conclude  that 


oo  _  oo 

1  >  MO + E  E  (^)i'  =  MO + E  1^(2-'  ap- 

/=0  P  l=Q 

Because  of  (C),  therefore,  ay  (?)  <  1  -  Etc  1^(2"'  ?)P  =  ESi  ?)P-  Hence, 

MOd^  ^  £  W2'?)Prf?  =  (e  2""')  ll^lll  =  ^  <  oo. 

This  shows  that  ay  is  integrable  and  Theorem  (3.2)  is  established. 

Before  ending  this  section  let  us  make  some  observations.  We  introduced  the  charac¬ 
terization  of  systems  . . . ,  ip^}  that  generate  tight  frames  in  the  discussion  fol¬ 

lowing  Proposition  IV.  Theorem  (3.2)  was  stated  and  proved  for  the  case  L=l.  As  we 
stated  before,  we  did  this  for  simplicity.  If  we  assume  the  orthonormality  of  the  system 
{ipjk}y  /  =  1, . . . ,  L,  1  ^  e  Z”,  it  is  easy  to  see  that  our  proof  goes  through  when  equality 
(C)  is  satisfied.  It  is  a  curious  fact  that  if  we  assume  that  the  functions  have  norm  at  least  1 
then  (C)  and  (D)  are  equivalent  to  the  property  that  the  system  is  an  orthonormal  basis 
for  We  need  not  assume  that  the  L  functions  I  =  1, . . . ,  L,  are  mutually  orthogonal. 
This  orthogonality  is  a  consequence  of  (C)  and  (D). 

One  can  extend  Theorem  (3.2)  to  tight  frames  that  are  semiorthogonal  (that  is  the  sub¬ 
spaces  Wj  are  mutually  orthogonal  and  the  system  is  a  tight  frame  for  the  subspace  it 
generates). 

The  characterization  of  shift  invariant  subspaces,  at  least  in  the  case  of  finitely  many 
subspaces  Ve,  was  introduced  in  [RS].  The  general  case  was  also  obtained  by  M.  Bownik  [B2] 
using  a  different  approach. 
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4.  Multiresolution  Analyses  in  R" 

A  multiresolution  analysis  (MRA)  is  a  sequence  of  subspaces  {V^},  jeZ,  of  L^(R")  sat¬ 
isfying  the  following  conditions 

(4.1)  VjCVj+iJtZ, 

(4.2)  feVj  if  and  only  if  /(2-)  j  cZ, 

(4.3)  Uj.zVj  =  L2(R"), 

(4.4)  there  exists  e  Vo  such  that  {(^(*  -  A:)},  A:  e  Z" 
is  an  orthonormal  basis  for  V^. 

The  function  (p  in  (4.4)  is  called  a  scaling  function  for  this  MRA. 

It  is  not  hard  to  show  that  (4.1),  (4.2)  and  (4.4)  imply 

(4.5)  n,ezVj  =  {0}. 

The  proof  of  the  one-dimensional  version  of  this  implication  can  be  found  on  page  45  of 
[HW].  The  argument  given  there  is  very  similar  to  the  one  we  presented  in  Lemma  (3.13).  In 
fact,  we  adapted  the  argument  in  [HW]  to  provide  the  proof  of  (3.13). 

The  construction  of  a  wavelet  basis  from  an  MRA  can  be  described  in  the  following  way: 
For  each  jeZ  let  Wj  =  Vj+i  fi  V^-^.  A  consequence  of  the  above  hypotheses  is  that  these 
spaces  are  mutually  orthogonal  with 

(4.6)  Vj  eWj  =  Vj+uj€Z, 
and 

(4.7)  L'^{W‘)  =  ®j,zWj. 

If  we  can  find  a  system  $  =  {V"',  . . . ,  ip^}  C  such  that 

{!/.'(• -*)},(  =  !,  2,...,  L.fceZ" 

is  an  orthonormal  basis  for  Wo,  then  (4.2),  (4.6)  and  (4.7)  imply  that 

is  an  orthonormal  basis  for  That  is,  ^  generates  a  wavelet  basis  for  L^(R"). 


G.  Weiss  and  E.N.  Wilson  /  The  Mathematical  Theory  of  Wavelets 


351 


It  is  convenient  to  express  these  properties  in  terms  of  the  Fourier  transform.  By  doing  so 
we  shall  see  that  the  construction  of  appropriate  systems  ^  raises  some  interesting  problems 
and,  in  particular,  we  will  discover  that  L  must  equal  2”  -  1.  It  is  clear  that 

H  =  {/  ■f^Vo}  =  {/(O  =  m(Ov(0  :  meL^T)} 

(where  m  is  1 -periodic  and  its  Fourier  coefficients  are  the  coefficients  in  the  expansion  /  = 
a*  -  A;),  that  is  provided  by  property  (4.4)). 

The  elements  of  Vq,  by  (4.2),  also  provide  us  with  those  of  V-i  and  W^i.  In  each  case 
they  are  of  the  form  d{2^)  with  OcVq;  moreover,  we  must  have 

(4.8)  0{2i)=m(Om 
and,  since  ip  satisfies  (A), 

•'«"  •'T”  fcezn 

=  f  Hopde 

JT" 

By  polarization,  therefore, 

(4.9)  ^)i2(]Kn)  —  (t,  5)l2(T’^) 

whenever  0{2  ^)  =  ^(2  0  =  5($)^($)  are  representations  of  0,  'ip  eVo  via  equal¬ 

ity  (4.8).  We  shall  examine  some  consequences  of  (4.8)  and  (4.9)  when  6  =  (p  and  ^  e  I^o  is 
such  that  the  system  —  k)},ke  Z”,  is  orthonormal.  It  will  be  useful  to  consider  the  set 
of  vertices  of  : 


{e=  (ei,£2,"-  ,^n)  :  =  0  or  1,1  ^  ^  n}. 

By  (4.8)  we  have  (p{2  $)  =  mo(^)^(0  and  since  (p  satisfies  equality  (A)  (because  the  trans¬ 
lates  by  A;  e  Z”  of  form  an  orthonormal  system)  we  have 

1=  E  iv(2e+fc)p  =  EM^+5^)H^(^+5^)i' 

fceZ”  keZ 

=  E  i^(«+|+oP  =  E 

eei?n  icZ”  eet?n 

We  have  used  the  1 -periodicity  on  ttiq  and  the  fact  that  to  each  A:  e  Z”  there  exists  a  unique 
^eZ”  and  such  that  (1/2) A:  =  I  +  (l/2)e.  This  shows  that  the  2*^ -dimensional  vector 
with  components  mo(?  +  (1/2)6)  has  norm  1  (for  a.e.  $  eR”): 
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(4.10)  1=  5]  |mo(e  +  ie)p. 

Since  we  are  assuming  tpeWo  also  satisfies  (A)  this  shows  that  the  “filter”  m  defined  by  the 
equality  ^(2^)  —  ?n(()^(()  also  satisfies  (4.10).  ^ 

We  are  also  assuming  that  (p  and  -  k)  are  orthogonal  for  all  A:  e  Z”.  Hence, 

jRn  Jjn  J 

That  is,  all  Fourier  coefficients  of  the  1-periodic  function  J2uz^  +  /)  are  0.  Thus 

for  a.e.  $  e 


(4.11) 

Consequently, 


0  =  +  k). 

fceZ" 


0=  ^(2?  +  A;)^(2?  +  fe)  = 

X)  +  \k)m{^  +  ^k)  +  i  fc)P  = 


X^  »«o(?  +  ^ ^ e)  X^  !?{?+ |e  +  0l^  = 

ee#„  (tZ" 

X^  "io(C  +  ^£)m(f+^e). 

ee<9n 

This  shows  that  the  2^^ -dimensional  vectors  {mo(f  +  ^s)}  and  {m(^  -f  ~s)},eeT9n,  are 
orthogonal  to  each  other: 


(4.12)  0=1]  +  \  +  \ 

eci?n 

In  fact,  these  properties  characterize  the  “wavelets”  -0  e  Wo  :  ^  e  Wq  is  such  that  the  system 
{'0(-  -  A)},  A:  e  Z„,  is  orthonormal  if  and  only  if  the  vector  {m(^  -f  |  e)},  e  e  'dn,  has  norm  1, 

^In  general,  /  e  Vb  itf  /{O  =  t(0^(C)  with  t  1-periodic  and  in  L^{T^).  We  shall  call  1 1\\& filter  associated 
with  /.  It  is  unique. 
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as  in  (4.10),  and  satisfies  (4.12)  (just  reverse  the  order  of  the  sequence  in  the  equalities  that 
established  (4.10)  and  (4.12)). 

Let  us  use  the  notation  =  (p  and  =  -0  (for  a  ^  e  Wq  having  the  properties  we  just 
discussed).  Suppose  we  find  =  2, 3, . . . ,  L,  in  with  filters  mi  (that  is,  ^^(2f)  = 
1 -periodic)  such  that 


(4-13)  X)  =  <5iii2 

S€‘dn 

a.e.,  then  the  collection  ^  generates  the  orthonormal  system  I  = 

1, 2, . . . ,  L,  €  Z,  /c  e  Z”.  It  is  clear  that  L  cannot  exceed  2”-l  since  the  (L-f  1)  x  2”  matrix  has 
row  vectors  {m/(^  -h  |  e)},  £  e  0  ^  ^  L,  that  satisfy  (4.13);  that  is,  the  L  + 1  row  vectors 
form  an  orthonormal  system  for  a.e.  ^  (as  the  last  calculation  in  this  section  shows,  we  must 
have  L  =  2”  —  1).  Consequently,  we  are  presented  with  the  following  question:  given  the 
MRA  {Vj}yjelj,  with  the  scaling  function  cp,  how  do  we  find  2”— 1  filters  mi{^)  such  that  the 
2^  X  2”  matrix  M  (^)  having  rows  {mi  (^+ 1  e:} ,  e  e  is  unitary  for  a.e.  It  is  clear  from  our 
discussion  that  once  this  is  achieved,  then  the  collection  ^  . . . ,  generates 

a  wavelet  basis  for  L^(E”).  In  this  connection  let  us  make  the  following  observation.  Since 
AI(0  is  unitary  a.e.,  its  “first  column”; 


is  a  vector  of  norm  1 : 


w»o(?) 

mi(0 

ni2»-i(6 


^  m,(^)mi(0  =  1- 

1=0 


Ef=o'  ?(?)  =  viO-  This  shows  that 


p€  spanlV^Li^jt  :  /  =  0, 1,2, .  ..,2"  -  1,A:€Z"}  C  V-i  ©  W-i. 

Since  this  last  space  equals  Vq  which  is  shift  invariant  and  is  spanned  by  {(^(*  —  k)}^  A;  eZ”, 
it  follows  that  the  span  of  the  functions  1  ^  ^  2”  -  1,  just  considered,  must  be  Ty_i. 
We  can  conclude,  therefore,  that  ^  generates  a  wavelet  basis. 

It  is  natural  to  consider  the  problem  of  finding  the  filters  1  <  /  ^  2”-l,  given  the  filter 

mo  associated  with  the  scaling  function  mo.  We  begin  by  looking  for  a  filter  mi(^)  such  that 
the  vector  {mi($  +  |e)},£ei?„,  has  norm  one  and  is  orthogonal  to  {mo(?  +  |£)},£ei9„, 
for  a.e.  This  can  be  done  in  several  ways.  Here  is  a  simple  construction:  Let  1  — 
(1, 1, . . . ,  l)eR”  and£o  =  (1, 0, . . . ,  0, 0)eR’^.  If  we  define  mi(f)  =  e*^'imo(?  +  ^),  then 
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the  two  vectors  Mi{Cl  =  +  l^)},  e  /  =  0, 1  are  orthogonal  to  each  other  and  of 

unit  length  a.e..  This  provides  us  with  one  of  the  desired  wavelet  basis  generators  if  we 
define  it  by  the  equality  (2^)  =  mi  The  search  for  is  more  involved. 

Perhaps  a  few  observations  in  the  case  n  —  2  provide  us  with  an  insight  into  this  situation.  If 
mo  is  real-valued  let  >fo(0  =  (^^0(6 >  6)5  ^o(?i5  6  +  ^o{6  + $2)^  + 1?  6  + 1))» 

and  define  the  row  vectors 

/  =  1,2,3,  where 


/-OOl  0\  /O-IO  0\  /OO 

000  -1  1  00  0  ._  00 
”  -1  0  0  0  ’  “  0  0  0  -1’^^  01 
\0100/  \0010/  \10 

These  provide  us  with  the  unitary  matrix 

’ -A^oK)  1  r  mo(0---  mo(^  +  |(l,l)) 

TTlf\-  -  ”^1  (?)•••  1(1-1)) 

A^2(0  “  m2(e  +  |(l,l)) 

.X3(0j  L"»3(0---  m3(e  +  i(l,l))  J 

such  that  the  first  column  gives  us  the  filters  determining  the  collection  V'2,t/’3} 

that  generates  a  wavelet  basis  associated  with  the  MRA  we  are  considering:  V’'(2^)  = 
mi{^)!p(^),  1  =  0, 1, 2, 3.  If  the  components  of  Ado(5)  Me  complex-valued  let 

(so(?)™o(0.  Si(0™o(f  +  ^(0, 1)), 

S2(Ot”o(?  +  ^(1.0)), 

=  (|mo($)l,  |mo(^  +  i(0, 1))|,  |mo(^  +  i(l,  0))|,  lm„(C  +  ^(1, 1))|) 


so(0  0  0  0 

0  si(0  0  0 

0  0  S2(0  0 

0  0  0  S3(?) 

where  each  S((^)  is  unimodular,  1  =  0, 1, 2, 3.  5(0  is,  then,  a  unitary  matrix  and,  letting 


(4.14) 


A4i(0=^o(05(0ii5*(0, 
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1  =  0, 1, 2, 3,  we,  again,  obtain  the  desired  filters  by  selecting  the  first  components  of  these 

vectors.  The  matrices  /,  Li,  L2,  can  be  considered  to  represent  the  generators  1,  i,  j,  k  of 
the  quaternions  (observe  that  Lf  =  /  =  1, 2, 3,  L1L2  —  L^,  L2L3  =  1  and  L3L1  =  L2). 

We  can  try  to  extend  the  idea  of  this  construction  by  using  the  Cayley  numbers  and,  for  higher 
dimensions,  the  Clifford  algebras.  One  encounters,  however,  some  difficulties  by  following 
this  path.  Even  in  the  two-dimensional  case,  the  vectors  defined  by  (4.14)  may  lack  desired 
smoothness  due  to  the  discontinuities  of  the  signum  functions.  Thus,  even  if  the  scaling 
function  is  compactly  supported  and  has  other  desirable  properties,  we  cannot  expect  the 
wavelets  so  obtained  to  be  compactly  supported. 

One  can  obtain  compactly  supported  complex- valued  wavelets  from  an  MRA  in  E^,  how¬ 
ever,  by  taking  tensor  products  of  1 -dimensional  MRA’s.  The  general  case  can  be  easily 
understood  once  we  present  the  following  two-dimensional  case.  Let  0  be  a  scaling  func¬ 
tion  in  L^{R)  that  is  compactly  supported  with  an  accompanying  low  pass  filter  s  such 
that  the  wavelet  (  satisfying  C(20  =  0(^)  is  compactly  supported  (see  chapter 

2  of  [HW]  where  the  Daubechies  wavelets  are  constructed;  s(f)  in  this  case  is  a  trigono¬ 
metric  polynomial).  Then  (p{x^y)  —  0{x)9(y)  is  a  scaling  function  for  an  MRA  in  L^(E") 
and  the  polynomials  mo(^)  =  mo($i,  ^2)  =  g(6)^(6)> 

m2(0  -  s(fi  -}-  |)5(^2),  ^(^j  _j.  i)  +  -)  will  then  provide 

us  a  unitary  matrix  ^7(^),  as  in  equality  (4.14),  from  which  we  obtain  the  system  of  com¬ 
pactly  supported  wavelets  ^(a;,?/)  =  {'ip^(x,y),'il)^{x,y),'ip^(x,y)}  satisfying  “0^2^)  = 
mi(^)(p{^),  /  =  1, 2, 3.  It  is  clear  that  this  construction  extends  to  n-dimensions  and  it  gives 
us  a  compactly  supported  scaling  function  (p{x)  =  ^(xi, . . .  ,a:„)  that  will  produce  a  sys¬ 
tem  ^(a;)  =  {'ip^ix), . . .  ,0^”~^(a:)}  of  compactly  supported  wavelets;  furthermore,  the  1- 
dimensional  scaling  functions  whose  product  is  cp  can  be  different  scaling  functions  of  the 
variables  a;i,X2, . . . ,  a;„. 

Wavelets  such  as  the  ones  we  just  described,  as  well  as  more  general  tensor  products 
obtained  by  partitioning  the  variables  a;i,a;2, . . .  into  m  subsets,  are  sometimes  called 
separable.  Construction  of  wavelets  having  smooth  filters  is  challenging  even  in  the  separable 
case,  if  we  require  that  they  not  be  tensor  products  of  one  dimensional  functions  (see  [A]  for 
an  elegant,  but  not  simple,  such  construction). 

There  exist  wavelet  bases  for  L^(E”)  that  are  generated  by  single  functions.  They  can  be 
constructed  directly  by  using  the  basic  equations  (A),  (B),  (C),  and  (D)  (see  [SoW])  or  their 
existence  can  be  established  by  employing  operator  theoretic  methods  (see  [DL]).  Clearly, 
these  cannot  be  MRA  wavelets.  The  wavelets  obtained  in  [SoW]  are  MSF  wavelets;  that  is, 
the  absolute  value  of  their  Fourier  transforms  are  characteristic  functions  of  a  set  C  E”,  a 
wavelet  set.  Such  sets  are  “fractal”  and  enjoy  various  interesting  properties  that  are  described 
in  [SoW]. 

It  is  natural  to  ask  if  there  is  a  characterization  of  MRA  wavelets.  The  answer  is  that 
there  exists  a  simply  stated  condition  that  determines  whether  a  function  (or  a  collection  of 
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functions)  is  a  wavelet  obtained  from  an  MR  A.  Let  us  first  consider  the  one  dimensional  case 
and  suppose  is  an  MRA  wavelet;  that  is,  ^(2^)  = 
mo,  mi  are  1 -periodic  functions  satisfying,  in  particular, 

K(OP  +  K(OP  =  i, 

for  a.e.  f  (this  follows  from  the  1-dimensional  versions  of  the  fact  that  >1(0  is  unitary). 
Consequently, 

|^(20P  +  |v(20l^  =  +  |mo(OP}i?(OP  =  1  •  l^(0P 

a.e.  Iterating  this  argument  we  obtain 


Iv(0P  =  I?(2"'0P  +  E 

j-i 

a.e.  for  each  positive  integer  N.  It  is  clear  that  the  limits 

lim  ^  |^(2^'0I^  and  lim  |?(2'^0P 
j=l 

exist  (observe  that  these  sums  are  bounded  and  increasing)  and  the  integrability  of  to¬ 
gether  with  Patou’s  lemma,  imply  that  the  last  limit  is  0  a.e..  Thus, 

oo 

(4.15)  =  W2'0l"a.e. 

j=l 

Since  {(/?(•  -  k)  :  k  cZ}  is  an  orthonormal  system,  we  have,  by  Proposition  I, 


(4.16)  D40  =  EE  W2^K+fc))r  =  i 

j=l  keZ 

a.e..  The  1-periodic  function  (clearly  integrable  on  [0, 1)  whenever  tp  e  L‘^{R))  is  known 
as  the  dimension  function.  Equality  (4.16)  characterizes  the  1-dimensional  MRA  wavelets: 

Theorem.  Suppose  ipeV-  (E)  is  a  wavelet.  Then  ip  is  an  MRA  wavelet  if  and  only  if  D^{^)  = 
1  a.e.. 

See  chapter  seven  of  [HW]  for  a  discussion  and  appropriate  credits  for  this  result. 

The  dimension  function  can  be  defined  by  equality  (4.16)  for  any  ipeL'^iW)  (the 
only  change  is  that  k  now  ranges  throughout  Z").  Essentially  the  same  argument  we  have 
just  given  shows  that  if 
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is  an  MRA  wavelet  system,  then 


L  oo 

(4.17)  =  E  E  E  +  fc))P  =  1 

1=1  j=l  fc€Z” 

a.e..  We  also  observe  that  this  equality  can  only  hold  if  L  =  2”  -  1.  Indeed,  if  (4.17)  is  true, 
then 


1=/  i)«(Orfe=EEE/ 

1=1  i=l  teZ" 

L  oo  «  L  00  /  1  \ 

=  EE  /  W2'e)i^de=EE2’'"iiV''iii=i(^) 

(=1  j=i  i=i  j=i  V 


5.  The  connectivity  of  wavelets. 

Let  us  return  to  the  “classical”  1 -dimensional  wavelets.  We  shall  discuss  the  connectivity 
of  this  class.  We  begin  by  showing  that  the  set  of  MRA  wavelets  is  an  arcwise  connected  set: 

Theorem  (5.1).  and  'il^i  are  two  MRA  wavelets,  then  there  exists  a  continuous  map 
A  :  [0, 1]  — y  L^(M)  such  that  A(0)  =  ipo,  A(l)  =  -01  and  A{t)  is  an  MRA  wavelet  for  all 
f6[0,l]. 

This  result  is  due  to  Xingde  Dai  and  Rufeng  Liang .  Their  proof  is  presented  in  [Wu]  (they 
are  members  of  the  Wutam  Consortium).  We  shall  present  the  basic  ideas  of  their  argument. 
In  order  to  do  so  we  will  use  some  of  the  notions  introduced  in  [Wu].  We  begin  by  introducing 
three  “multipliers”  that  play  important  roles  in  the  theory  of  wavelets: 

Definition.  A  measurable  function  i/  on  R  is  a  wavelet  multiplier  if  and  only  if  (i/  0)^  is 
an  o.n.  wavelet  whenever  ip  is  an  o.n.  wavelet,  z/  is  a  scaling  function  multiplier  if  and  only 
if  {u  ^)^  is  a  scaling  function  whenever  is  a  scaling  function.  A  measurable  function  //  is  a 
low  pass  filter  multiplier  if  and  only  if  /i  m  is  a  low  pass  filter  whenever  m  is  a  low  pass  filter. 

If  (p  ipy  is  an  MRA  wavelet  whenever  0  is  an  MRA  wavelet  we  say  that  z/  is  an  MRA 
wavelet  multiplier. 

These  multipliers  have  been  completely  characterized  in  [Wu]: 

Theorem  (5.2).  P  is  a  wavelet,  MRA  wavelet  or  scaling  function  multiplier  if  and  only 
if  it  is  unimodular  and  p[2f)lp{^)  is  a.e.  equal  to  a  1 -periodic  function,  p  is  a  low  pass  filter 
multiplier  if  and  only  if  it  is  a  unimodular  function  that  is  equal  a.e.  to  a  1 -periodic  function. 

(The  term  “unimodular  function”  means  that  the  function  in  question  has  absolute  value  1 
a.e.). 
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A  few  observations  are  in  order.  If  ipo  is  an  MRA  wavelet  then  the  other  wavelets  be¬ 
longing  to  the  same  MRA  are  precisely  those  functions  ip  such  that  ip  =  uipo  where  u  is 
unimodular  and  a.e.  equal  to  a  1-periodic  function.  If  z/  is  a  wavelet  multiplier,  ip  —  {I'lpoy 
may  very  well  belong  to  a  different  MRA  (by  the  last  theorem  in  section  4  and  Theorem  (5.2), 
however,  we  do  know  that  such  a  is  an  MRA  wavelet).  We  shall  be  interested  in  the  classes 

=  {all  wavelets  ip  :  |^(0I  =  |^?o(0l  a-®-} 

whenever  ipo  is  a  wavelet.  It  follows  from  the  above  observations  that  if  ipo  is  an  MRA  wavelet 
then  each  ip  e  is  an  MRA  wavelet  and  contains  all  the  wavelets  generated  by  the 
MRA  that  produced  ipo.  Furthermore,  if  ip  =  (i>  ipo^,  where  z/  is  a  wavelet  multiplier,  then 
Ip  t  Thus,  if,  for  a  wavelet  ^o»  we  let 

=  {ip  =  {uipoy  :  1/  a  wavelet  multiplier}, 

then 

(5.3)  C  Wv,o 

We  shall  consider  a  third  class  of  wavelets  generated  by  an  MRA  wavelet  ipQ.  In  order  to 
introduce  this  class  it  is  helpful  to  use  the  following  characterization  of  the  scaling  functions 
(see  chapter  7  of  [HW]): 

Theorem  (5.4).  A  function  ip  e  L^(R)  is  a  scaling  function  for  an  MRA  if  and  only  if 

keZ 

2.  lim  1^(2"'^  I  =  1  a.e.; 

j-^oo 

3.  there  exists  a  1-periodic  m  e  L^([— 1/2, 1/2))  such  that 

^(20  =  m(0?(0- 

An  immediate  consequence  of  this  theorem  is  that  |^|  is  the  Fourier  transform  of  a  scaling 
function  whenever  is  a  scaling  function.  The  function  m  is,  of  course,  the  low  pass  filter 
associated  with  the  scaling  function  cp  via  the  equality  in  3  of  Theorem  (5.4);  furthemore,  m 
is  uniquely  determined  by  ip  and  satisfies  the  Smith-Bamwell  equality 

(5.5)  |m(?)P  +  |m(e  +  i)|^  =  l 

for  a.e.  The  most  general  wavelet  belonging  to  the  MRA  generated  by  (p  satisfies 

^(20  =  +  i)v(0 


(5.6) 


G.  Weiss  and  E.N.  Wilson  /  The  Mathematical  Theory  of  Wavelets 


359 


where  s  is  a  unimodular  1 -periodic  function.  An  immediate  consequence  of  (5.5)  and  (5.6) 
is  that  if  (p^  and  are  two  scaling  functions  such  that  |^^(0I  = 

MRA  wavelets  associated  with  (p^  and  then  1  —  1  It  follows  that  the  class 

SjPq  of  all  MRA  wavelets  associated  with  a  scaling  function  p  satisfying  |^(0I  = 
a.e.  (^0  being  an  MRA  wavelet  associated  with  po)  is  a  subset  of  W^q.  On  the  other  hand, 
equality  (4.15)  shows  that  C  Thus, 

(5.7)  Wv-o  = 

whenever  ^Ijq  is  an  MRA  wavelet.  In  fact  we  have 

Theorem  (5.8).  If  ip  is  an  MRA  wavelet,  then 

A4ip  =  =  Sjp. 

This  result  is  proved  in  [Wu]. 

The  sets  and  can  be  defined  for  any  wavelet  'ip,  not  necessarily  MRA.  In  view  of 
Theorem  (5.8),  it  is  natural  to  ask  if  the  equality  is  true  for  all  wavelets  V’.  The 

answer  is  “No;”  Q.  Gu  constructed  a  clever  counterexample. 

Let  us  describe  the  two  basic  ideas  of  the  proof  of  this  theorem.  First  one  shows  that 
each  of  the  classes  ^  MRA  wavelet,  is  connected.  Second,  one  shows  that  each 

MRA  wavelet  ipi  can  be  connected  to  the  Shannon  wavelet  'ipo-  As  we  shall  see,  this  part  of 
the  argument  is  facilitated  by  the  fact  that  we  can  choose  'tpi,  by  Theorem  (5.8),  so  that  its 
associated  scaling  function  pi  satisfies  ^i(^)  ^  0  a.e.  (since  this  clearly  can  be 

done). 

For  the  first  part  of  this  proof,  which  establishes  that  is  connected,  one  chooses 
(by  Theorem  (5.8))  so  that  ipi  =  i/?/;  for  an  appropriate  wavelet  multi¬ 
plier.  Since  i/  is  unimodular  (Theorem  (5.2))  we  can  write  z/(^)  =  A(f),  however,  is 

not  unique,  but  it  is  easy  to  construct  an  appropriate  A  so  that  ^'t(^)  =  is  a  wave¬ 

let  multiplier  (i/f(2^)/Vf(^)  is  1 -periodic)  for  fe[0, 1].  We  then  obtain  a  continuous  map 
9  :  t  — >  ipt  =  (I't  i’Y  on  [0, 1]  such  that  for  f  =  0  we  have  0(0)  =  -00  =  -0  and,  for 
f  =  1,  we  have  0(1)  =  ipi. 

The  second  part  of  the  proof  is  somewhat  more  involved.  We  present  the  basic  features  of 
the  argument.  As  explained  above,  we  can  assume  that  we  have  a  wavelet  ipi  constructed  from 
a  scaling  function  pi  such  that  pi  (^)  >  0  a.e..  It  suffices  to  connect  'tpi  to  the  Shannon  wavelet 
00  whose  scaling  function  po  satisfies  ^o($)  =  ^  follows  that  the  corresponding 

filters,  mo  and  mi,  are  non-negative.  It  is  tempting  to  define 


(5.9) 


TUtiO  =  + 
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for  t  €  [0, 1]  (observe  that  the  Smith-Bamwell  equality,  +  mt{(  -f  i)2  1  is  true).  This 

provides  us  with  a  continuous  path  of  low  pass  filters.  The  corresponding  scaling  functions 
satisfying 


form  a  continuous  path  of  scaling  functions.  The  desired  path  of  wavelets  is,  then,  obtained 
by  letting 


M'iO  =  +  n)lpt{0 

for  t  e  [0, 1] .  This  scheme  “almost  works”.  The  main  modification  needed  is  that  the  inter¬ 
mediate  filters  be  obtained  by  an  equality  that  is  technically  more  complicated  than  (5.9) 
(see  [Wu]  page  588). 

The  consideration  of  the  connectivity  of  wavelets  is  very  natural.  The  “first  wavelets” 
(besides  the  Haar  and  Shannon  wavelets),  constructed  in  the  early  eighties  by  Lemari^  and 
Meyer,  are,  in  a  real  sense,  obtained  by  a  continuous  “smoothing”  (on  the  Fourier  trans¬ 
form  side)  of  the  Shannon  wavelet.  A  general  result  on  connectivity  was  obtained  in  [BDW]. 
The  authors  in  this  work  concerned  themselves  with  wavelets  produced  by  very  smooth  fil¬ 
ters.  The  paths  obtained  were  continuous  with  respect  to  a  topology  that  is  considerably 
stronger  than  that  produced  by  the  L^(M)-norm.  There  is  a  topological  impediment  that 
prevents  one  from  connecting  two  wavelets  in  general  when  this  stronger  topology  is  used. 
For  example,  if  is  the  Haar  wavelet  and  'ipi(x)  =  'ipoix  —  1)  is  its  translate  by  1,  then 

The  fact  that  the  function  that  is  identically  1  and  s(f)  are 
not  in  the  same  homotopy  class  prevents  the  existence  of  a  path  joining  these  two  wavelets 
that  is  continuous  in  the  topology  used  in  [BDW].  These  questions,  as  well  as  extensions  of 
the  results  in  this  last  work,  are  discussed  by  G.  Garrigds  in  [Ga]. 

An  interesting  connectivity  result  was  obtained  by  D.  Speegle  [SP]  who  showed  that  the 
collection  of  MSF  wavelets  is  connected.  Let  us  say  a  few  words  to  put  this  result  in  some 
perspective.  One  of  the  first  examples  of  a  wavelet  that  is  not  an  MRA  wavelet  was  obtained 
by  J-L  Joum^e.  His  wavelet  is  an  MSF  wavelet  (see  [HW],  page  64).  It  follows  that  the 
union  of  the  MRA  and  MSF  wavelets  is  an  arc  wise  connected  subset  of  the  surface  of  the 
unit  sphere  in  L^{R).  Auscher  [A]  and  Lemari6  [Le]  have,  independently,  shown  that  if  one 
makes  rather  mild  assumptions  about  the  Fourier  transform  of  a  wavelet  (continuity  and  a 
decrease  at  oo),  the  wavelet  arises  from  an  MRA.  We  see  therefore,  that  “most”  wavelets 
are  either  MRA  or  MSF  wavelets.  It  is  not  unreasonable  to  conjecture,  therefore,  that  the 
collection  of  all  wavelets  in  L‘^(R)  is  connected.  The  answer  to  this  conjecture,  however,  is 
not  yet  known. 

If  we  consider  the  question  of  connectivity  and,  more  generally,  multipliers,  in  connection 
with  higher  dimensional  wavelets  produced  by  more  general  dilations  we  find  various  factors 
that  alter  the  form  of  the  results  we  seek.  We  have  seen,  for  example,  that  the  MRA  wavelets 
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in  higher  dimensions  require  more  than  one  generator.  This,  however,  depends  on  the  dilation 
that  is  used.  This  is  not  the  case  for  those  wavelets  on  obtained  by  the  translations  by  the 
points  of  the  lattice  c  R^  and  the  dilations  obtained  from  the  integral  powers  of  the  matrix 


That  is,  if  ^  e  L^(R^ ),  the  system  {'ipjk}J  e  Z,  fc  e  Z^  is  defined  by 

(5.10)  i^jkix)  =  (det  -  k)  =  2^^^  -  k) 

for  a;  e  R^  (M^x  denoting  the  vector  obtained  by  multplying  the  matrix  by  the  column 
vector 


A  function  such  that  {ipjk}^jeZ^ke’L‘^,  is  an  orthonormal  basis  for  L^(R^)  is  called  a 
Quincunx  wavelet.  There  exist  MRA  wavelets,  ip,  of  this  type  producing  orthonormal  bases 
of  the  form  {ipjk}y  just  as  in  the  one-dimensional  case.  That  is,  even  though  the  dimension  of 
the  underlying  space,  R^,  is  2,  we  do  not  need  3  (—  2^  -  1)  generators  to  obtain  a  “wavelet 
basis”  for  L^(R^)  as  is  the  case  described  in  section  4. 

The  quincunx  wavelets  share  many  other  properties  with  the  “classical”  wavelets  on  R 
obtained  by  integral  translations  and  dyadic  dilations.  In  particular,  the  Wutam  program,  for 
the  quincunx  wavelets,  is  just  like  the  one  we  just  presented. 

Let  tp  e  L^(R^)  and  {'ipjk}  be  the  system  defined  by  equality  (5.10).  Then,  letting  Mi  be 
the  transpose  of  M,  we  have 

(5.11)  ipjkiO  =  2“^^"  e 

for  $  e  R^ ,  j  €  R  and  k  e  1?.  Until  further  notice,  we  shall  use  the  term  “wavelet”  for  an  or¬ 
thonormal  quincunx  wavelet  (that  is,  a  function  ip  e  L^(R^)  such  that  the  system  defined  by 
(5.10)  is  an  o.n.  basis  for  L^(R^)).  The  characterization  of  wavelets  obtained  by  general 
dilations.  Theorem  (2.3),  reduced  to  the  situation  we  are  now  studying,  becomes 

Proposition  (5.12)..  Let  ipeL^(^)  such  that  \  \ip\\2  ^  L  Then  ip  is  an  o.n.  wavelet  if 
and  only  if 

(I)  |^;(M^e)p  =  l/ora.e.^£R^ 

jeZ 


(H)  Y.  +  «))  =  0/ora.e.  f eK= 
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whenever  q  = 


e  1?  with  Qi ,  q2  having  different  parity. 


Let  T  =  T2  =  {e  -  (6,6) :  ^  6  <^3  =  2}  and 


S  =  (MT)\T 

(MT  is  the  square  with  vertices  (0,1),  (-1,0),  (0,-1)  and  (1,0)).  Letting  ip  be  defined  by 
having  ip{^)  =  *%'s(0,  il  is  easy  to  check  that  ip  is  an  orthonormal  wavelet  that  is  also  an 
MRA  wavelet.  A  scaling  function  'tp  for  the  MRA  is  obtained  by  letting  ^(6  =  ^^(O  and 
its  associated  low  pass  filter  m  is  the  characteristic  function  of  T,  restricted  to  T,  and 
then  extended  to  be  1 -periodic  in  each  variable.  In  fact,  this  provides  us  with  an  example  of 
an  MRA,  MSF  wavelet  in  the  quincunx  case  (the  analogue  of  the  Shannon  wavelet). 

The  notion  of  multiplier  (wavelet,  scaling  function  or  filter  multiplier)  as  well  as  the 
definition  of  the  sets  W^p,  and  extends  in  an  obvious  way,  and  so  does  Theorem  (5.8). 
In  fact,  it  is  not  hard  to  obtain  the  version  of  Theorem  (5.1)  that  asserts  that  the  MRA  quincunx 
wavelets  are  path-connected.  The  details  of  this  “Wutam  program”  adapted  to  this  situation 
were  presented  in  a  Ph.D.  qualifying  oral  exam  by  one  of  our  students,  L.  Zhang. 


6.  Summary  and  Bibliographical  comments. 

As  we  stated  at  the  beginning,  one  of  our  motivations  was  to  present  the  ‘‘Mathematical 
Theory  of  Wavelets.”  Obviously,  we  cannot  do  this  in  any  exhaustive  way  in  this  rather  short 
article.  Our  aim  was  to  describe  some  of  the  beauty  of  this  theory  and,  if  possible,  elaborate 
topics  that  are  new.  We  hope  that  this  write-up  does  have  some  of  these  properties. 

We  have  not  considered  many  important  uses  of  wavelet  theory  in  mathematics.  For  ex¬ 
ample,  their  application  to  the  construction  and  deriving  properties  of  a  large  collection  of 
important  function  spaces  (the  Besov  and  Triebel-Lizorkim  spaces)  is  not  a  topic  we  dis¬ 
cussed.  Chapters  5  and  6  in  [HW]  and  [FJW]  deal  with  this  topic. 

We  described  various  kinds  of  wavelets,  scaling  functions,  filters  and  we  gave  character¬ 
izations  of  almost  all  these  function  with  the  exception  of  low  pass  filters.  One  of  us  was 
involved  in  the  characterizations  of  all  low  pass  filters  in  1 -dimension  and  for  dyadic  dila¬ 
tions  [PSW].  The  characterization  of  the  dual  systems  (^,  ^)  announced  by  Theorem  (2.3) 
really  pertains  to  the  case  where  the  dilation  matrix  A  has  integer  entries  and  the  translations 
are  obtained  from  the  lattice  Z”  (in  terms  of  the  notation  used  in  section  2,  the  matrix  P 
must  have  integer  entries  and  this  similarity  reduces  the  problem  to  the  lattice  Z").  There  are, 
however,  wavelet  systems  that  involve  more  general  dilations  and  translations;  in  [CCMW] 
the  problem  of  the  characterization  of  such  systems  is  solved. 

Though  we  concentrated  our  attention  to  wavelets,  we  indicated  that  other  systems,  ob¬ 
tained  by  applying,  say,  modulations  and  translations  to  a  fixed  function,  are  also  of  interest. 
For  example,  systems  of  the  form 
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(6.1)  g,nn(x)  =  e^^^^^g{x-n), 

m,neZ,  the  Gabor  systems,  have  been  studied  extensively.  A  very  general  context  that 
produces  the  “continuous”  versions  of  such  systems  (as  well  as  the  continuous  wavelets)  can 
be  described  by  the  collection  of  (n  +  2)  x  (n  +  2)  matrices  of  the  form 


/I  6 

(6.2)  S  =  0  /I  c  , 

Vo  0  l) 

where  b  is  an  n-dimensional  row  vector,  c  an  n-dimensional  column  vector,  A  e  GL{n,  M)  and 
zeM.  If  we  let  the  matrix  (6.2)  act  on  the  column  vector  (on  the  right) 


where  u,yeR  and  ?;  6 E" ,  we  obtain  the  action 


/  u  \  /  u  +  b‘V-\-  zy 

I  ^  I  — ^  1  +  cy 

\y  J  \  y 


This  induces  the  following  mapping 


{T,  v,  2/)  =  I  det  +  cy) 

defined  on  a  function  If  u  =  0,6  =  0  and  t/  =  1,  we  obtain  the  map  (2.1) 

(strictly  speaking,  we  should  use  g~^  instead  of  g\  the  variable  2:  is  not  important  and  its 
main  function  is  to  make  sure  that  the  matrices  (6.2)  form  a  group).  If  A  =  7,  y  =  1, «  =  0 
we  obtain  the  “continuous  Gabor  system,”  which  is  also  referred  to  as  the  Weyl-Heisenberg 
transform.  It  is  natural  to  consider  the  various  themes  treated  in  the  previous  questions  in  this 
general  setting.  For  example,  we  can  try  to  find  admissibility  conditions,  such  as  Theorem 
(2.1),  when  A  belongs  to  a  subgroup  of  GL{n,  E)  and  6,  c  c  E^ .  In  the  Weyl-Heisenberg  case 
this  is  considered  in  [LP];  see,  also,  the  discussion  in  [DGM]  that  is  relevant  to  this  case.  The 
authors  of  [LWWW]  are  engaged  in  an  investigation  of  these  problems  in  the  general  case. 
Particularly  challenging  is  the  question  of  the  discretizations  of  the  continuous  transforms. 
The  Gabor  system 
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is  a  particular  discretization  of  this  type  that  is  well  known;  various  conditions  guaranteeing 
the  orthonormality  and  frame  properties  of  these  functions  have  been  considered  by  many 
authors  (see,  in  particular,  [Cz]). 

We  have  included  some  items  in  the  bibliography  that  are  not  referred  to  directly.  In 
general  these  pertain  to  presentations  of  continuous  wavelets  and  their  discretizations  that 
should  be  compared  with  our  presentation;  there  are,  also,  certain  aspects  of  what  we  describe 
here  that  are,  formally,  quite  similar  to  work  done  in  representations  theory.  We  suggest  that 
the  reader  examine  [BGZ],  [BT],  [Car],  [DM],  [GM],  [GMPJ,  [GMP2],  [GP],  [K],  [LP],  [M], 
and  [ST]  for  the  work  done  that  is  related  to  section  2. 

One  last  word  about  the  “characterization”  equalities  we  talked  about.  Clearly  it  is  a  good 
thing  to  find  out  descriptions  of  the  class  of  all  functions  having  certain  properties  (wavelets, 
scaling  functions,  tight  frame  wavelets,  low  pass  filters,  etc.)  We  claimed,  at  the  end  of  section 
1,  that  the  equalities  we  are  considering  have  been  very  useful  for  many  constructions.  The 
book  [HW]  presents  many  examples.  We  cite  [BGRW]  as  another  example  that  attracted 
some  attention. 
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Part  2 


Problems 


Assorted  Problems 


Various  Authors 

1.  Preservation  of  bandlimitedness  under  non-afbne  time  warping  for 
multi-dimensional  functions 

S.  AzizU  McDonald,  and  D.  Cochran,  Arizona  State  University 

If  /  €  has  compactly  supported  Fourier  transform  and  7  :  has  the 

form  7(x)  =  Ax -\-h  where  A  e  GLd{R)  and  6  e  R'^,  then  _  /  o  7  also  has  compactly 
supported  Fourier  transform.  It  is  possible  to  construct  specific  /  and  7  so  that  /  and  /  o  7 
both  have  compactly  supported  Fourier  transforms,  /  is  not  the  zero  function,  and  7  is  a 
continuous  and  invertible  function  that  is  not  of  the  affine  form  just  given.  This  construction 
can  be  accomplished,  for  example,  by  considering  a  d-fold  cartesian  product  of  known  one¬ 
dimensional  examples  such  as  described  in  [1]. 

Under  the  assumption  that  7  :  R^  -)•  R^  is  continuous  and  invertible,  is  it  true  that  /  o  7 
has  compactly  supported  Fourier  transform  for  all  /  G  with  compactly  supported 

Fourier  transform  if  and  only  if  7(0:)  =  Arc  +  6? 

[1]  D.  Cochran,  S.  Azizi,  and  J.N.  McDonald,  “Harmonic  analysis  and  sampling  in  warped 
spaces,”  in  20th  Century  Harmonic  Analysis  ~  A  Celebration,  J.S.  Byrnes,  ed. 

2.  Window  pairs  that  determine  spectral  phase 

S.  Shetty,  J.N.  McDonald,  and  D.  Cochran,  Arizona  State  University 
Consider  a  finite  sequence 

(2.1)  a(0),  a(l),  ...,  a{n) 
of  complex  numbers  and  the  associated  polynomial 

(2.2)  p{z)  =  a(0)  +  a{l)z  H - h  a{n)z^. 

The  spectral  magnitude  of  the  sequence  (2.1)  or  its  associated  polynomial  (2.2)  is  the  function 
s  :  R  ^  C  defined  by  s{uj)  —  |p(e*‘^)|.  In  general,  it  is  clear  that  two  different  polynomials 
can  have  identical  spectral  magnitude  functions.  If,  however,  two  polynomials  p{z)  and  q{z) 
have  identical  spectral  magnitude  functions  and  their  respective  derivatives  p'{z)  and  q’[z) 
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also  have  identical  spectral  magnitude  functions  (i.e,,  \p'{z)\  ~  \q'{z)  \  for  all  z  with  \z\  =  1), 
then  it  can  be  shown  that  p{z)  =  cq(z)  for  some  complex  constant  c  with  \c\  =  l. 

Define  finite  sequences  Wi{k)  =  1  and  u;2(^)  =  k  for  k  =  0,  ...,n  and  consider  the 
sequences  bi{k)  —  a{k)wi(k)  and  62 (^)  ==  a{k)w2{k)  obtained  by  “windowing”  the  original 
sequence  a(0), a(n)  with  each  of  these  new  sequences.  The  polynomial  associated  with 
^2(0),...,  b2{n)  is  r(z)  =  zp\z),  which  has  spectral  magnitude  function  identical  to  that  of 
■^(z).  Hence,  the  spectral  magnitude  functions  of  the  two  windowed  sequences  are  sufficient 
to  determine  p{z)  up  to  a  unimodular  constant  factor. 

The  problem  posed  here  is  to  characterize  all  pairs  of  window  sequences  u;i(0),  ..,wi{n) 
and  u;2(0),  ...W2(n)  with  the  property  that  the  spectral  magnitude  functions  of  the  sequences 
bi{k)  =  a(k)wi(k)  and  62 (A:)  =  a{k)w2{k)  are  sufficient  to  determine  any  polynomial  p{z) 
of  degree  n  up  to  a  unimodular  constant  factor.  The  corresponding  problem  with  spectral 
phase  in  place  of  spectral  magnitude  is  also  of  interest. 


3.  Questions  on  Riesz  Products 

J.  R  Kahane,  University  Paris-Sud 

Riesz  products,  identified  with  positive  measures,  are  of  the  form 

00 

n=:l 

n=l 


where  (A„)  is  a  given  Hadamard  sequence  (Ai  >  0,  >  q  >  1\  a  =  {a„}  and  b  —  {6n}, 

|a„|  <  1  and  \bn\  <  1  for  all  n. 


3.1.  Question  1 


Give  a  necessary  and  sufficient  condition  on  (a,  6)  for  pa  ^  Pb- 
Partial  answers  are: 


Er  \an-bn?<<X> 
sup  |a„|  <  1 


Pa  ^  Pb 

Er 


(Peyziere,  Brown-Moran) 

-  <  CXD  ^ 


|®nl  l^n| 


Pa  ^  Pb 


(Parreau,  Ann.  Inst.  Fourier  40,  2  (1990),  391-405). 
Let  us  write 

00 

n=l 
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3.2.  Question  2 


fla  ^ 

Fan  Ai-Lua  is  the  author  of  this  question.  A  positive  answer  would  also  answer  question 
1,  using  the  following  theorem  of  Klimer  and  Saeki  (Ann.  Inst.  Fourier  38, 2  (1988),  63-93) 
(c/.  also  Fan,  StudiaMath.  (1991)  249-266). 

If  the  (pn  are  random  with  the  usual  probability. 


in  =  arg(a„  +  6„) 
Sn  =  arg(an  -  6^) 


Motivations  and  hints  can  be  found  in  the  article  of  Fan  in  Bull.  Sc.  Math.  2®  s.,  117 
(1993),  421-439. 


4,  Completeness  of  sets  of  complex  exponentials  in  convex  sets;  open  problems 

B.N.  Khabibullin,  Department  of  Mathematics,  Frunze  str  32,  Bashkir  State  University,  Ufa, 
Bashkortostan,  450074,  Russia, 

algeoin©bsu .  bushedu .  ru  and/or  khabib-bulat@mail .  ru 

Let  A  =  {An},  n  G  N,  be  a  sequence  of  pairwise  different  complex  numbers  (points)  in 
the  complex  plane  C,  0  ^  A,  and  An  ^  oo  as  n  -Hoo. 

Let  A  be  a  convex  open  or  closed  set  on  C .  By  if  (X)  denote  the  space  of  all  continuous 
complex-valued  functions  on  X  which  are  holomorpic  in  the  interior  Int  X  of  X  (if  Int  X  ^ 
0)  with  the  topology  of  uniform  convergence  on  compact  subsets  of  X.  An  exponential  system 
Exp  A  =  {exp(An^)}  is  complete  in  X  if  the  closure  of  the  linear  span  of  Exp  A  in  if(X) 
coincides  with  H{X). 

The  completeness  problem  for  a  set  X  is  to  determine  all  such  sequences  of  exponents 
A  for  which  the  system  Exp  A  is  complete  in  X.  On  the  background  of  known  results  it  is 
natural  to  look  for  the  solution  of  the  problem  in  terms  of  the  density  of  the  distribution  of 
the  points  of  A  and  in  terms  of  the  geometric  characteristics  of  X.  It  is  natural  to  subdivide 
the  completeness  problem  into  six  cases  according  to  the  type  of  X : 

1)  X  =  [-0,  a]  is  a  segment  on  real  axis  E,  a  >  0,  and  H(X)  =  C[-a,  a]; 

2)  X  =  (—a,  a),  where  a  G  (0,  +oo],  or  X  =  [0,  +oo),  or  X  =  (— oo,  0]; 

3)  X  =  G  is  an  unbounded  convex  domain  in  C,  [0,  +oo)  C  G\ 

4)  X  =  Clos  G,  where  G  is  the  same  as  above,  Clos  G  is  the  closure  of  G', 

5)  X  =  (?  is  a  bounded  convex  domain  in  C; 

6)  X  —  K,  where  X  is  a  compact  convex  set  in  C,  Int  X  ^  0; 

Below,  every  item  (k-!)  or  (k-?)  is  a  continuation  of  item  k),  and  (k-!)  (respectively  (k-?)) 
denotes  a  solved  (respectively  unsolved)  problem. 
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(1-?)  No  precise  condition  is  known  for  the  completeness  of  systems  Exp  A  in  [-a,  a],  and 
finding  such  a  condition  is  a  very  difficult  problem.  In  this  case  we  present  a  test  for  the 
completeness  of  systems  Exp  A  in  [-a,  a]  to  within  a  single  exponential  (see  [5,  Theorem  7] 
together  with  recent  results  of  B.  Cole  and  T,  Ransford  [2]). 

Here  and  below  we  denote  by  Q  the  class  of  all  extended  Green  functions  5£>(C,  0)  for  the 
point  0  and  arbitrary  bounded  regular  (for  the  Dirichlet  problem)  domains  Z)  c  C,  0  G  D, 
go{(,0)  =  0forC^D. 


Theorem  1.  If 

sup 

equals  +oo,  then  the  system  Exp  A  is  complete  in  [—a,  a].  Conversely,  if  this  quantity  is 
bounded  above,  then  by  deleting  a  single  (arbitrary)  exponential  from  the  system  Exp  A  we 
obtain  a  system  that  is  not  complete  in  [—a,  a]. 


(2-!)  In  this  case  the  completeness  problem  was  solved  completely  by  the  profound  Beurling- 
Malliavin  theorem  on  the  radius  of  completeness  [1]. 

(3-!)  When  X  =  G  is  an  unbounded  convex  domain  in  C,  the  completeness  problem  was 
solved  completely  in  [3,  Theorem  2]. 

(4-?)  The  width  of  an  unbounded  convex  domain  G,  [0,  +oo)  C  G,  is  defined  to  be  the 
quantity  da  —  lim^-^+oo  where  dcix)  =  supjlz  -  z'\  \  z,z'  e  G,  ^z  =  ^z'  =  x}, 
X  G  [0, +00). 

If  either  do  =  +00  or,  for  all  a;  G  E  we  have  d^x)  ^  dg  <  +00,  then  sharp  conditions 
of  completeness  of  Exp  A  in  Clos  G  are  the  same  as  in  (3-!).  In  the  opposite  case,  there  is  a 
test  for  the  completeness  of  a  system  Exp  A  in  Clos  G  only  under  conditions  that  the  sequence 
A  is  separated  from  the  imaginary  axis  (  see  [4,  Corollary  4.2]): 


Theorem  2.  Let  G  be  an  unbounded  convex  domain  of  width  2'Kd  <  +00,  [0,  +00)  C  G, 
and  suppose  there  exists  one  value  g/j:  G  E  such  that  dcix)  —  do-  Suppose  that  a  sequence 
A  satisfies  the  condition  |3RA„|  ^  (5|A„|,  n  ^  no,  for  a  certain  number  5  >  The  system 
Exp  A  is  complete  in  Clos  G  if  and  only  if 


sup 

l<r<R<+oo 


max 


{ ^  ■ 

v  ■r^lAn|<R 

9?A„<0 


5R- 


E 


r^lA„i<R 

3?A„>0 


=  +CXD. 


(5-?)  At  present  the  completeness  problem  has  not  been  solved  satisfactorily  for  any  bounded 
convex  domain  G,  In  our  article  [5,  Theorem  5]  we  give  a  complete  solution  of  the  complete¬ 
ness  problem  in  a  bounded  convex  domain  G  in  terms  of  so-called  Jensen  functions.  After  [2] 
this  result  can  be  formulated  in  the  following  way. 
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Let  G*  be  symmetric  to  G  with  respect  to  E,  and  let  sg{0)  be  the  arc  length  of  the 
boundary  of  Clos  G*  from  a  given  point  to  the  next  (counter-clockwise)  point  of  contact  of 
the  supporting  line  that  is  orthogonal  to  the  direction  6. 

Let  g  E  Q,  The  function  kg{9)  =  g{te^^)  dt  is  called  the  indicator  of  g.  Always  the 

indicator  kg  is  the  support  function  of  a  convex  compact  set  Kg  such  that  0  £  Int  Kg  ^  0. 
We  s&t  S {Kg,  G)  =  I  kg{6)  dsoiO) .  This  quantity  is  the  mixed  area  of  the  convex  sets 
Kg  and  G*. 

Theorem  3.  The  system  is  complete  in  the  convex  hounded  domain  G  if  and  only  if 
infrf  ^  1,  where  the  infimum  is  taken  over  the  numbers  dfor  which  sup^^^  — 

^S{Kg,G))  <+00. 

In  this  theorem,  the  class  Q  of  Green  functions  plays  the  role  of  test  functions.  However, 
the  class  Q  is  too  wide.  Some  more  transparent  sufficient  conditions  can  be  extracted  from 
the  Theorem  3.  For  example,  they  are  in  [6,  Theorem  A].  It  is  possible  that  these  conditions 
are  necessary. 

(6-?)  No  precise  condition  is  known  for  the  completeness  of  systems  Exp  A  in  K,  The 
completeness  problem  in  K  is  apparently  a  very  difficult  problem  (cf.  with  X  =  [-a,  a]). 
We  present  a  test  for  the  completeness  of  systems  Exp  A  in  /i"  to  within  two  exponentials  ( 
see  [5,  §  7,  3.]  together  with  [2]). 

Theorem  4.  If  the  quantity  sup^^g  -  ~S{Kg,K)^  equals  H-oo,  then  the 

system  Exp  A  is  complete  in  K.  Conversely,  if  this  quantity  is  bounded  above  by  a  finite 
number,  then  by  deleting  two  (arbitrary)  exponentials  from  the  system  Exp  A  we  obtain  a 
system  that  is  not  complete  in  K, 
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5.  Density  of  Domain  of  the  Weighted  Hilbert  Transform 

Alexander  Kheifets,  khe  i  f  e  t  s  @ina  il.ru 


5.1.  Problem  1 

Let  A  be  a  nonnegative  measurable  function  on  the  unit  circle  T.  Assume  that  ^ 
and  for  definiteness  ||^||i  =  1.  In  this  case  /  €  ^^(A)  implies  /  e  which  allows  one  to 
define  the  Hilbert  transform  Hf  for  /  e  L^(  A)  : 


S{T)dT 

T-t+ 


27ri  Jrp  T  —  t- 


t  G  T, 


where  the  first  and  the  second  integrals  are  understood  as  the  radial  limits  from  inside  and 
outside  of  the  unit  disk  D  respectively.  But  Hf  need  not  be  in  L^(A).  We  say  that  /  is  in 
the  domain  of  H  on  L^(A)  if  both  /  and  Hf  are  in  L^(A).  Assume  that  the  domain  of  H  is 
dense  in  L^(A).  The  question  is:  can  this  property  be  characterized  in  terms  of  A? 

One  can  show  that  if  ^  =  |p|,  where  g  is  in  the  Hardy  space  then  the  domain  of  H 
is  dense  in  L^(A)  if  and  only  if  g  is  an  exposed  point  of  the  unit  ball  in  the  space  H\,  A 
point  6o  in  a  complex  or  real  Banach  space  B  is  said  to  be  an  exposed  point  of  the  unit  ball  if 
||6o||  =  1  and  there  exists  a  real  continuous  linear  functional  F  on  jB  such  that  F(bo)  =  1  and 
F(b)  <  1  for  all  other  b  in  the  unit  ball.  Regarding  extreme  and  exposed  points  of  the  unit 
ball  in  i/1  see  [1],  [2],  [3]  and  [4]. 

Obviously  integrability  of  A  is  a  sufficient  condition  for  density  of  the  domain  but  it  is 
not  a  necessary  one.  The  interesting  case  is  the  one  when  A  is  not  integrable. 


5.2.  Problem? 

Let  u;  be  a  measurable  unimodular  function  on  the  unit  circle  T:  \w{t)\  =  1  for  a.a.  t  gT. 
Assume  that  set  P+w\Hl  is  dense  in  i/^,  where  Hi  is  the  Hardy  space  of  analytic  functions 
and  is  the  orthogonal  projection  from  onto  i/^.  The  question  is:  how  this  property  can 
be  characterized  in  terms  of  w? 

One  can  show  that  the  property  holds  true  if  and  only  if  u;  is  a  canonical  solution  to 
a  Nehari  problem.  The  Nehari  problem  consists  in  finding  all  the  functions  w  bounded  in 
modulo  by  1  on  the  unit  circle  T:  |iy (<)  |  <  1  for  a.a,  tGT,  with  given  where  P_  is  the 
orthogonal  projection  from  onto  i/^.  Details  regarding  the  Nehari  problem  can  be  found 
in  [5],  [6],  [7],  [8],  [1],  [4]. 

It  is  also  known  that  it;  is  a  canonical  solution  of  a  Nehari  problem  if  and  only  if  it  is 
of  the  form  =  ]f[ ?  where  g  is  an  exposed  point  of  the  unit  ball  in  i/^.  In  this  case  the 
representation  is  unique.  See  [2],  [4] . 
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6.  Harmonic  Sliding  Analysis  Problem 

Vladimir  Ya.Krakovsky,  UNESCO/IIP  International  Research-Training  Centre  for  Informa¬ 
tion  Technologies  and  Systems,  40  Academician  Hlushkov  Ave,  Kyiv  03680  Ukraine, 
krakovsk@uasoiro . f reenet . kiev . ua 

Harmonic  sliding  analysis  (HSA)  is  a  dynamic  spectrum  analysis  [1]  in  which  the  next 
analysis  interval  differs  from  the  previous  one  by  including  the  next  signal  sample  and  exclud¬ 
ing  the  first  one  from  the  previous  analysis  interval.  Such  a  harmonic  analysis  is  necessary 
for  time-frequency  localization  [2]  of  analysed  signals  with  given  peculiarities.  Using  the 
well-known  Fast  Fourier  transform  (FFT)  is  not  effective  in  this  context.  More  effective  are 
known  recursive  algorithms  which  use  only  one  complex  multiplication  for  computing  one 
harmonic  during  each  analysis  interval. 

To  yield  an  instant  spectrum 

(6.1)  F,ip)  =  lr  E  9  =  0,1,2,  ... 

^  k^q-N+l 

it  is  possible  to  use  a  simple  recursive  algorithm,  described  in  [3],  [4]: 

(6.2)  F,(p)  =  F,_i(p)  +  l[/(9)  -  f{q  -  N)]W^’’\  p  €  O^P^,  «  =  0, 1, 2 . 

This  algorithm  has  a  remarkable  peculiarity  which  permits  one  to  organize  HSA  so  that 
one  complex  multiplication  may  be  used  for  computing  two,  four  and  even  eight  (for  complex 
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signals)  spectrum  harmonics  at  once  [5],  [6],  [7],  [8].  This  may  be  done  as  follows.  Let 
algorithm  (2)  be  presented  as  follows: 

(6.3)  F,{p)  =  F,-i(p)  +  AF,(p),  p  e  0,  P  -  1,  g  =  0, 1, 2,  . . .  , 

1  Ott 

(6.4)  AP,(p)  =  jylf(g)  -  f(q  -  Af)]  exp[-j— pg]. 

The  spectrum  increments  AFfj(p)  may  be  used  not  only  for  the  spectrum  harmonic  p,  but 
for  spectrum  harmonics 

(6.5)  Pi  =  iiV/4 +  p,  ^  e  1,3 
and 

(6.6)  pk  =  kN/i  -  p,  A:  €  1, 4 

as  well,  using  known  properties  of  the  complex  exponential  function.  In  a  summarized  (and 
simplified)  view  the  algorithm  (3)  modification  may  be  presented  as  follows: 

a)  for  spectrum  harmonics  (5) 

(6.7)  P,(pi)  =  P,-i(Pi)  +  (-j)'''AP,(p),  g  =  0, 1, 2,  ...  , 

b)  for  spectrum  harmonics  (6) 

(6.8)  P,(p*)  =  P,-i(p;t)  +  g  =  0, 1,2,  . . .  , 

where  AFq(—p)  are  complex  conjugated  spectrum  increments  AFq{p),  if  the  signal  samples 
are  real,  and  if  the  signal  samples  are  complex,  AFq{—p)  are  generated  by  inverting  the  signs 
of  the  products  of  the  signal  increments  Afq  =  ^[f{q)  —  f{q  —  A^)]  with  the  imaginary  part 
of  the  weighting  function  and  then  forming  the  appropriate  algebraic  sums. 

In  such  a  way  it  is  possible  to  use  one  complex  multiplier  for  computing  up  to  four 
harmonics  for  a  real  signal,  and  up  to  eight  harmonics  for  a  complex  signal,  simultaneously. 

We  can  now  state  the  HSA  problem:  Is  it  possible  to  double  the  speed  of  the  response  by 
using  an  additional  multiplier?  If  so,  how? 
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7.  Questions  on  Kazhdan’s  Property  (T)  on  Hypergroups 

Liliana  Pavel,  University  of  Bucharest,  Faculty  of  Mathematics,  Academiei  14,  70109  Bu¬ 
charest,  Romania,  lpavel©©lan .  unibuc .  ro 

Isolated  points  of  the  dual  of  a  locally  compact  group  (endowed  with  the  hull-kernel 
topology)  were  first  discussed  by  Diximier  [1].  Kazhdan  [5]  discovered  there  exists  a  clear 
connection  between  the  fact  that  the  class  of  the  one  dimensional  trivial  representation  is  an 
isolated  point  in  the  dual  of  a  locally  compact  group  (in  this  case  the  group  is  said  to  have 
property  (T))  and  many  interesting  group  properties. 

In  [6]  we  have  initiated  the  study  of  property  (T)  on  hypergroups  (with  Haar  measure), 
obtaining  only  some  introductory  results.  We  have  given  an  appropriate  definition  of  property 
(T)  for  hypergroups:  this  definition  is  an  extension  of  the  corresponding  one  for  locally 
compact  groups  of  [3].  Consequently,  we  say  that  a  hypergroup  K  has  property  (T)  if  each 
continuous  representation  of  K  which  has  almost  invariant  vectors  has  also  invariant  vectors. 
(We  recall  that  a  representation  tt  of  iiT  on  the  Hilbert  space  has  almost  invariant  vectors  if, 
for  any  e  >  0  and  C  C  K,C  compact,  there  exists  a  with  ||a||  —  1  such  that  llTr^a  - 
a\\  <  £,  Vx  €  C7;  TT  has  invariant  vectors  if  there  exists  b  e  with  ||6||  =  1,  such  that 
TTxb  =  6,  Vx  G  K.) 

We  note  that  in  the  particular  case  when  K  is  sl  locally  compact  group  this  definition 
is  equivalent  with  the  one  given  by  Kazhdan  [5].  For  arbitrary  hypergroups,  we  have  only 
obtained  that  if  iiT  is  a  hypergroup  with  property  (T),  then  the  class  of  the  one  dimensional 
trivial  representation  is  an  isolated  point  in  the  dual  of  K  { [6,  Theorem  2]),  so  we  ask  if  the 
converse  is  still  valid. 

It  is  well  known  that  any  amenable  non-compact  locally  compact  group  has  no  property 
(T) .  This  is  an  immediate  consequence  of  the  fact  that  a  locally  compact  group  G  is  amenable 
if  and  only  if  it  satisfies  Reiter’s  condition  (P2):  Ve  >  Q,WC  C  G,C  compact,  3/  G  L2{G), 
f  ^  0,  II/II2  =  1,  such  that  ||5x  *  /  “  /II2  <  Va;  G  C,  or,  equivalently,  the  left  regular 
representation  Xq  of  G  on  L2{G)  has  almost  invariant  vectors;  Xq  obviously  has  no  invariant 
vectors  if  G  is  not  compact. 

For  non-compact  hypergroups  (with  Haar  measure),  the  amenability  is  not  equivalent  to 
{P2),  In  [7,  Example  4.6]  an  example  is  given  of  a  non-compact  amenable  hypergroup  that 
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does  not  satify  (P2);  it  is  also  proved  that  (P2)  implies  the  amenability  ( [7,  Theorem  4.1]). 
Consequently,  for  hypergroups,  we  can  obtain,  with  similar  arguments  as  for  locally  compact 
groups,  that  any  non-compact  hypergroup  K  satisfying  condition  (F2)  has  no  property  (T). 

For  example,  if  PT  is  a  non-compact  amenable  hypergroup  with  a  supernormal  subhyper¬ 
group,  or  if  ii"  is  a  non-compact  commutative  hypergroup  with  the  Plancherel  measure  on  the 
dual  such  that  its  support  contains  the  trivial  character,  then  K  satisfies  (P2)  ( [7,  Theorem 
4.7,  Lemma  4.5]).  In  these  cases,  K  has  no  property  (T). 

We  can  not  yet  give  a  general  answer  concerning  the  relation  between  amenability  and 
property  (T)  for  the  hypergroups  case.  Thus,  we  close  with  the  question:  Does  there  exist  an 
amenable  non-compact  hypergroup  having  property  (T)  (?). 
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8.  Optimal  Speech  Signal  Partition  into  One-Quasiperiodical  Segments 

Taras  K.  Yintsiuk,  UNESCO/IIP  International  Research-Training  Centre  for  Information 
Technologies  and  Systems,  Kyjiv  03680  Ukraine, 
vintsiuk@uasoiro . f reenet . kiev.ua 


8.1.  Introduction 

It  is  well  known  that  analysis  of  such  complicated  signals  as  speech  signals  has  to  be 
carried  out  synchronically  with  a  current  pitch  period  (quasiperiod).  Besides,  for  speech 
signal  it  is  important  to  find  current  one-quasiperiod  segment  duration,  beginnings  and  ends 
as  well. 

To  solve  this  problem  quasi-periodicity  and  non-periodicity  signal  models  are  proposed. 
Each  hypothetical  one-quasiperiodical  signal  segment  is  considered  as  a  random  distortion  of 
previous  or  following  one  taken  with  the  unknown  multiplying  factor.  The  problem  of  optimal 
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current  pitch  period  discrimination  and  speech  signal  partition  into  quasiperiodical  and  non¬ 
periodical  segments  consists  in  1)  the  finding  the  best  quasiperiod  beginnings  or  the  one- 
quasiperiod  segments  under  restrictions  on  both  value  and  changing  of  current  quasiperiod 
duration  and  multiplying  factor  and  2)  the  association  of  optimal  one-quasiperiod  segment 
signals  into  large  quasiperiodic  and  non-periodic  segments.  For  this  problem  solving  an 
effective  algorithm  based  on  dynamic  programming  have  to  be  proposed. 


8.2.  One-Quasiperiodicity  Models 


Let  the  signal  fn,n  =  l :  Nhe  observed  where  /„  is  a  signal  value  at  the  discrete  uniform 
time  nAt  with  step  At,  for  example  At  =  50  ps  for  speech  signal.  If  the  (s  +  l)-th  one- 
quasiperiod  signal  segment  beginning  is  denoted  by  Ug ,  then  (n^  —  1)  will  be  the  end  of  the  s- 
th  one-quasiperiod.  Further  segment  signal  j  =  0  :  {Tg  —  1) ,  Tg  =  Ug-  rig-i  will 

be  called  5-th  one-quasiperiod  segment  with  duration  Tg  ,  if  it  is  approximated  sufficiently 
well  by  neighbouring  ones,  (s  -  l)-th  or  (s  +  l)-th,  which  respectively  precedes  or  follows 
the  s-th  one-quasiperiod  segment.  The  latter  is  taken  with  the  unknown  multiplying  number 
a7ora+: 


(8.1) 


f  ^3  fna~2+jl  i  —  0  *  1)  j 

\  0,  i  =  min(r„r,_i)  : 


/o  f+  _  I  fna+j,  j  =  0  :  (min  (T,,  T,+i)  -  1) ; 

j  =  mm(T3,T3+i):(T3-l). 

Let  us  introduce  a  priori  restrictions  for  multiplying  number  value  a  and  both  current 
quasiperiod  duration  value  Tg  and  its  changing  Ag  =Ts-  Tg^i. 

(8.3)  {ois  :  0  ^  CKjnin  ^  ^  ^max}  =  ^min  ^  ^  ^max)  j^s|  ^  ^max- 

Let  us  fix  the  elementary  quasi-periodicity  (EQP)  measure  for  the  5-th  one-quasiperiodic 
signal  segment  =  fn.-T.+j,  j  =  0:{T,-  1)  as: 

d  ((n„  T3) ,  Aj)  =  min  ^2  {fn.-i+j  -  fna-,+j}  = 

j=0 


a£A 


mm(rs-As  jTa)-! 


Ts-1 


V  \  /  \  2  V ■  "A  rt 

(8.4)  =min  -  «/„,-2r,+A7+i)  +  X/ 


+3 


3=0 


i=min{Ta-As  ,Ts 


or 


|A+|^AmaxQ;^_4  ,  X  : 

d  ((n*,  T,) ,  A+)  =  min  min  2^  +j  -  /.t-i+j  j 

3=0 
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(8.5) 


0^A+^A.nax 

=  min 


min 


min(Ta-A+,Ta)-l 

{fns-Tt,+j  ~~  Oifn+j) 

j^O 

Ts~l 

+  E  fl,-T,^ri 

j=min(Ta-A+,Ts) 


+ 


\ 

/ 


As  it  is  followed  from  the  expression  (8.4-8.5)  the  signal  segment 


fns-\+3  ~  fria-Ta+ji  3  ~  ^  •  {'^s  l) 

is  tested  on  quasiperiodicity  by  comparison  with  previous  segment 

fns-2Ta+Ai+j^  3  ~  ^  ’  {^s  —  ~  l) 


and  all  possible  following  ones  /n^+j,  j  =  0  :  (T^  -  -  1) ,  jA"^!  ^  A^ax  but  only  the 

best  comparison  result  is  associated  with  the  quasiperiodicity  measure  value  ((n^,  Tg) ,  A^). 

Any  permissible  variant  {{ris.Ts) ,  Af) ,  s  ==  0, 1, 2, P  of  the  signal  n  =  1  :  iV 
segmentation  on  P  one-quasiperiodic  segments  under  restrictions  (8.3)  is  characterised  by  the 
sum  of  respective  EQP  measure  values: 


p 

(8.6)  G(n5,  s  =  0  :  P)  =  A*) 

To  find  for  the  signal  /„,  n  —  1  :  A  the  best  segmentation  onto  unknown  number  P 
one-quasiperiod  segments  it  is  necessary  to  minimise  the  criteria  (8.6)  on  all  permissible 
sequences  n^,  s  =  0  :  P. 


8.3.  Problems  to  he  Solved 

The  following  problems  have  to  be  solved. 

1.  To  propose  effective  dynamic  programming  algorithm  for  optimal  partition  of  signal  into 
one-quasiperiodical  segments. 

2.  To  determine  ways  how  to  unite  one-quasiperiodical  segments  into  large  quasiperiodic  or 
non-periodic  ones. 

3.  For  each  time  n  to  formulate  some  necessary  and  sufficient  conditions  that  a  certain 
pair  (n*,T*) ,  n*  <  n  is  an  optimal  one-quasiperiodic  segment  beginning  n*  and  duration 
T*  independently  on  future  signal  after  n. 
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9.  A  question  of  “complexity” 

J.  A.  Ward,  Murdoch  University 

This  problem  arises  in  discrete-time  worst-case  system  identification  [9]. 

In  time-domain  identification  we  wish  to  identify  the  impulse  response  /i  of  a  time- 
invariant  linear  system  within  a  prescribed  tolerance,  using  a  finite  number  of  sampled  values 
of  the  output  y  corresponding  to  a  chosen  input  signal  u.  The  output  signal  y  is  the  sum  of 
the  system  response  u^h  and  a  noise  term  ??,  so  that  y  =  u*h  +  ri.  Both  u  and  h  are  bounded 
real  or  complex  valued  one-sided  sequences,  and  *  denotes  the  usual  convolution.  It  is  as¬ 
sumed  that  h  belongs  to  some  specified  model  set,  such  as  one  of  the  polynomial  sequence 
sets  Vn,  or  the  sets  V(p,r)  that  are  associated  with  linear  systems  whose  transfer  functions  H 
are  rational  functions  and  whose  poles  (if  any)  occur  outside  the  unit  circle.  Here  H  is  the  z- 
transform  h  of  the  impulse  response.  In  frequency-domain  identification  the  aim  is  to  identify 
H  using  a  finite  number  of  noisy  sample  values  of  H  on  the  unit  circle.  The  sample  values 
are  obtained  by  measuring  the  system’s  response  to  a  sinusoidal  input.  It  is  also  assumed  that 
H  belongs  to  some  prescribed  model  set,  such  as  the  disc  algebra  A(D),  although  additional 
conditions  may  be  placed  on  H, 

In  worst-case  identification  the  noise  term  q  is  simply  assumed  to  be  uniformly  bounded 
IMloo  ^  <5,  where  ||77l|oo  denotes  the  supremum  norm.  The  and  -norms  are 
most  often  used  to  measure  the  accuracy  of  model  estimates  (tolerance). 

Both  time-  and  frequency-domain  worst-case  identification  problems  can  be  recast  within 
a  more  abstract  framework  [7],  [2].  Suppose  that  A’  is  a  normed  linear  space, 


(Pl^<P2y  .  .  .  ,  (fN,  ■  •  • 


a  uniformly  bounded  sequence  of  continuous  linear  functionals  on  and  that  r  >  0.  Sup¬ 
pose  also  that 

Vk  =  (Pk{h)  -f-  qk  and  |7?fc|  <  for  each  1  <  A;  <  n, 


where  h  is  an  unknown  element  of  a  given  subset  MoiX.  Then,  using  2/i,  2/2j  •••»  X/n?  we  want 


to  find  hinMso  that 


h-h 


<  T.  If  we  can  do  this,  then  we  say  that  {<^i,  <^2?  •  •  •  ?  is 


a  ((5,  t) -identifying  set  for  M.  The  connection  with  time-domain  identification  is  that  each 
(fk{h)  =  (u  *  /i)fc,  where  u  is  the  chosen  input,  while  for  frequency-domain  identification 
(fkih)  =  H(zk)  —  h{zk)  where  zq,  zi, ...,  Zn-i  are  points  on  the  unit  circle. 

The  complexity  of  identification  in  M  for  a  given  tolerance  r  and  noise  bound  S  is  defined 
to  be  the  minimum  number  of  ‘observations’  or  functionals  required  to  obtain  an  estimate 
within  that  tolerance  for  the  given  noise  bound.  It  depends  on  M.  and  on  the  type  of  func¬ 
tionals  that  are  allowed.  Since  II/II2  ~  Il/llif2  ^  ll/lli/00  <  II /111  for  any  sequence  /,  the 
complexity  of  identification  in  M  decreases  as  we  replace  the  f  norm  on  A^,  by  the 
norm,  and  then  by  the  norm.  It  is  known  that  identification  is  typically  ‘exponential’  in 
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complexity,  and  so  not  very  practical  (see  [1],  for  example).  On  the  other  hand,  both  and 
f  identification  are  typically  ‘polynomial’  [3],  [5],  [8]. 

A  drawback  to  using  the  conservative  norm  to  measure  the  noise  component  is  that  a 
single  outlier  may  make  this  very  large,  and  this  in  turn  may  make  it  difficult  to  ‘identify’  the 
system  using  standard  techniques.  An  obvious  way  to  address  this  difficulty  is  to  replace  the 
norm  by  a  norm  that  is  less  sensitive  to  outliers.  More  generally  we  work  with  a  sequence 
(||||„)  of  seminorms  on  R”,  assuming  that  each  is  dominated  by  the  norm.  The  problem 
then  is  to  determine  the  complexity  for  different  choices  of  M  and  norms  on  the  noise. 

We  are  helped  in  this  task  by  the  observation  [3],  [2],  [7],  that  when  the  norm  is  used 
to  measure  the  noise,  then  linear  functionals  •  •  • ,  <^n  form  a  (^,  r) -identifying  set 

for  an  absolutely  convex  set  M  if  and  only  if 

max  \(pk{h)\  ^  6  for  each  h  e  Mr, 
where  Mr  =  {h  e  M  :  ||/i||  =  r}. 

A  related  result  [6]  for  the  generalised  noise  measure  case  is  that  there  is  a  ‘robustly 
convergent’  algorithm  for  estimating  h  on  the  basis  of  n  test  functional  values  if  and  only  if 
there  is  a  number  5  >  0  such  that 

lim  inf||(^i(/i),¥)2(ft):->¥>n(A))||„  >  e\\h\\  for  each  A. 

n— >oo 

This  is  relevant  because  it  can  be  shown  that  if  there  is  such  an  algorithm  then  for  any  noise 
bound  J  and  tolerance  level  r  there  is  a  finite  subset  of  the  </?a:S  that  form  a  ((5,  r) -identifying 
set  for  At. 

For  a  concrete  version  of  the  problem,  what  is  the  complexity  of  identification  if  we 
suppose  that  At  =  A(ID))  with  its  usual  norm,  and  that  ||||„  =  ||||^  for  all  n,  where  ||||^  is  an 
Orlicz  or  Lorenz  norm  on  A(P)?  Of  particular  interest  is  the  case  of  the  Lorenz  norm  given 
by  l|x|r  =  ||x||„  / 11(1, 1, 1)||„ ,  where 

n 

l|x|L=  sup  y]|3;„0)|u;jforeachx  =  e  R", 

and  E„  denotes  the  set  of  permutations  of  {1, 2, ...,  n},  and  where  w  =  (1, 1, ...,  1, 0,0, ..,  0), 
with  the  first  K  terms  equal  to  1. 

i  I 

References 

[1]  M.A.  Dahleh,  T.  Theodosopoulos  and  J.N.  Tsitsiklis,  The  sample  complexity  of  worst-case  identification  of 
FIR  linear  systems.  Systems  Control  Lett.,  20  (1993),  157-166. 

[2]  D.W.Hadwin,  K.J.Harrison  and  J.  A. Ward,  Worst-case  identification  of  linear  systems;  existence  and  com¬ 
plexity,  in  Proc.  Centre  for  Math,  and  its  Applications,  36  (1999),  53-65. 

[3]  Harrison,  K.J.,  J.R.  Partington  and  J. A.  Ward,  Complexity  of  identification  of  linear  systems  with  rational 
transfer  functions,  Math.  Control  Signals  Systems,  (1998) 

[4]  Harrison,  K.J.  and  J.A.  Ward,  Fractional  covers  for  convolution  products.  Result.  Math.,  30  (1996),  67-78. 


Various  Authors  /  Assorted  Problems 


383 


[5]  Harrison,  KJ.,  J.A.  Ward  and  D.K.  Gamble,  Sample  complexity  of  worst-case  -identification.  Systems 
Control  Lett.,  27  (1996),  255-260. 

[6]  Makila,  P.M,  and  J.R.  Partington,  Robustness  in  H°°  identification,  preprint  1999. 

[7]  Partington,  J.R.,  Interpolation  in  normed  spaces  from  values  of  linear  functionals.  Bull  London  Math.  Soc., 
26(1994),  165-170. 

[8]  Partington,  J.R.,  Worst-case  identification  in  linear  and  non-linear  algorithms.  Systems  Control  Lett.,  22 
(1994),  93-98. 

[9]  Partington,  J.R.,  Interpolation,  identification  and  sampling,  Oxford  University  Press,  1997. 


10.  Rate  of  decay  of  convolution  vs.  frequency  of  sign  changes 

H.S.  Shapiro 

All  references,  as  well  as  further  background  discussion  concerning  the  present  problem 
may  be  found  in  reference  [S]. 

Let  /  be  in  L°°(R+)  (the  class  of  real- valued  bounded  measurable  functions  on  (0,  oo)). 
Then,  the  following,  due  to  B.R  Logan,  Jr.,  is  known:  If 


(10.1)  y>0 

satisfies 


(10.2)  F(y)  =  0{e  as  y  oo 

for  some  a  >  0,  then  /,  the  even  extension  of  f  to  M.  (which  is  in  L°°(R))  has  spectrum 
disjoint  from  (-a,  a),  the  converse  is  also  true.  (The  spectrum  here  means  the  support  of  the 
distributional  Fourier  transform.)  As  an  illustration,  look  at  f(x)  =  cosax  (then,  f{x)  = 
cosox,  X  G  R).  Here  F{y)  is  the  Poisson  integral  of  cosax,  evaluated  at  a  point  of  the 
imaginary  axis,  so  F{y)  =  cos  ax.  Observe  that  in  this  case,  /(x)  has  an  essential  sign 
change  on  each  interval  in  R"^  of  length  greater  than  tt/o  (that  is,  on  each  such  interval  it 
assumes  positive  values,  as  well  as  negative  values,  on  a  set  of  positive  measure).  In  a  1965 
doctoral  thesis,  B.F.  Logan,  Jr.  raised  the  question  whether  (10.1)  implies  an  asymptotic  lower 
bound  for  the  amount  of  oscillation  of  /.  Namely,  defining  cr(x)  as  the  number  (possibly 
infinite)  of  points  of  (0,  x),  on  each  neighborhood  of  which  /  has  an  essential  sign  change, 
he  asked  whether 


(10.3) 


lim  in  / 
X  -4  OO 


X 


For  each  /  €  L°°(E),  not  identically  zero  and  satisfying  (10.2).  He  proved  the  answer  is 
affirmative  if  /  also  is  assumed  to  be  the  restriction  to  R+  of  an  entire  function  of  exponential 
type.  So  far  as  I  know,  no  one  has  proved  (or  disproved)  this  assertion  without  that  last 
hypothesis. 
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In  my  paper  [S]  I  remarked  that,  as  a  consequence  of  a  recent  theorem  due  to  Baouendi 
and  Rothschild,  the  weakening  of  (10.2)  to 

(10.4)  F(y)  =  0{y-^),  y  ^  oo 

for  every  positive  integer  n  implies  the  (weak)  oscillation  result;  /  has  an  essential  sign 
change  on  (6,  oo)  for  every  6  >  0. 

Certain  questions  now  almost  pose  themselves:  Suppose  F{y)  has,  as  ?/  — >  oo,  a  rate 
of  decrease  intermediate  between  (10.2)  and  (10.4)  (e.g.,  F{y)  =  0(e“^*)  for  some  t  with 
0  <  f  <  1.)  Can  one  assert  anything  about  the  frequency  of  sign  changes  of  /,  beyond  what 
already  follows  from  (10.4)? 

A  further  avenue  of  generalizations  appears  when  we  observe  that  (10.1)  is  a  convolution 
on  the  group  R+  (with  respect  to  the  Haar  measure  y)-  After  a  logarithmic  variable  change  it 
becomes  a  usual  convolution  on  R,  with  the  “kernel”  AT (x)  :=  (2/7r)(cosa:)~^  (details  in  [S]). 
Thus,  all  the  questions  we  have  raised  so  far  can  be  put  in  the  form:  Deduce  from  the  rate  of 
decay  at  +oo  of  gicK,  for  some  g  e  A°°(R),  lower  bounds  for  the  (asymptotic  frequency  of) 
essential  sign  changes  of  g.  Once  this  standpoint  is  taken,  one  can  raise  these  questions  for 
other  kernels,  like  K (x)  =  e~^  . 

One  final  remark:  It  is  a  corollary  of  the  result  that  (10.2)  implies  disjointness  of  spec¬ 
trum  /  from  (—a,  a),  that:  If  (10.2)  holds  for  every  a  >  0,  then  f  vanishes  a.e.  This  result 
predates  Logan’s  work,  and  is  a  special  case  (after  transformation  from  R+  to  R  by  the  loga¬ 
rithmic  variable  change)  of  general  results  due  to  I.I.  Hirschmann,  Jr.  from  1951.  We  might 
express  matters  thusly:  for  some  class  of  kernels  in  L^(R),  Hirschmann  found  the  critical 
rate  at  which  a  convolution  of  a  nontrivial  function  in  L°°(R)  with  this  kernel  may  decay  at 
oo  (i.e.,  faster  decay  is  impossible).  There  is  some  evidence  that  substantial  decay  implies 
corresponding  oscillatory  behavior;  this  seems  a  promising  and  mostly  unexplored  area. 
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11.  The  Problem 

Direct  Methods  for  crystal  structure  determination  are  based  on  mathematical  relations, 
so  called  phase  relations,  between  the  moduli  (observed  by  diffraction  experiments)  and  the 
(unknown)  phases  of  the  Fourier  coefficients  E(H)  of  the  periodic  electron  density  p(r).  The 
latter  can  be  approximated  by  one  electron  quantum  mechanical  wave  functions  ^(rj: 
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(11.1)  />(r)  =  I  'ip{rY\  ^  FT  ^  E{p)  =  "^{p)  * 

^  ^  ft  ^ 

A  question  arises  then,  whether  it  is  possible  to  obtain  phase  relations  from  fundamental 
Quantum  Mechanics.  We  show  below  that  the  Schrodinger  equation  in  Fourier  (momentum) 
space,  written  below  in  appropriate  atomic  units,  provides  a  basis  for  such  relations: 

Direct  space  Momentum  space 

(11.2)  -A^(r)/2  H-  V(r)V^(r)  =  eo^(r)  FT  p^^{p)/2  -\-W(p)*  "^(p)  =  £o^(p) 
with  V  ^  FT  =>  W 

For  a  periodic  crystalline  structure  the  convolution  integral  in  momentum  space  is  re¬ 
placed  by  a  discrete  summation  and  eq.  (2)  is  re-arranged  as  (3).  We  see  then  (with  an 
eigenvalue  £o  <  0  )  that  the  phase  of  the  wave  function  'if(p)  is  invariant  under  the  potential 
(convolution)  operator  W(p)*  : 

(1 1.3)  (i?V2  -  =  ~^kW{K)  -  K) 

The  next  key  remark  is  that  the  Fourier  Coefficients  W(K)  of  the  potential  function  V(r) 
are  identical  to  the  crystallographic  expression  -  Z  E(K)  /  K^.  For  a  unit  cell  containing 
N  identical  atoms  of  atomic  number  Z  we  have,  as  shown  in  Appendix: 

(11.4)  W(K)  =  -Zy'NE(K)/K^ 

Eq.  (4)  is  the  key  relation  linking  the  Quantum  Mechanical  potential  function  to  the 
diffraction  experiment  for  a  crystal.  Thus  eq.  (3)  is  written  as  (5),  as  shown  in  Appendix 

(11.5)  (ffV2  -  eo)^W  =  Zy'NZ,KE(K)^{H  -  K)/K^ 

(11.6)  E{H)  =  -  H) 

The  pair  of  eq.  (5)  and  (6)  forms  a  system  of  self  consistent  equations,  that  is  a  usual 
procedure  in  Quantum  Mechanics.  They  provide  the  physical  basis  of  the  TWIN  algorithm 
developed  for  crystal  structure  determination  (Hountas,  A.  and  Tsoucaris,  G.  (1995)  Acta 
Cryst.  A51,  754-763;  see  also:  J.  Navaza  and  G.  Tsoucaris,  in  Phys.  Rev.  (1981);  G, 
Berthier  et  al.  in  J.  Quantum  Chemistry,  1996,  p.  195-199).  The  final  form  (5)  of  the 
Schrodinger  equation  in  momentum  space  has  a  strong  similarity  with  one  of  the  fundamental 
phase  relations  of  Direct  Methods: 

(11.7)  Phase  of  E(H)  =  Phase  of  [EkE{K)E{H  ~  K)] 
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Note  that  the  positive  factor  IfK?  in  eq.  (5)  plays  the  role  of  a  weighting  factor  for  each 
contributor  E(K)  ^(H-K)  in  the  summation,  that  is  a  usual  technique  in  Direct  Methods. 

The  similarity  becomes  closer  with  a  further  approximation  related  to  the  classical  Linear 
Combination  of  Atomic  Orbitals  LCAO  method: 

^(H)  =  c  E(H)  with  a  known  constant  c>  0. 

In  conclusion,  we  have  shown  a  close  relation  between  a  fundamental  formula  in  Direct 
Methods  (obtained  by  Probability  theory  applied  to  the  FT  of  the  electron  density  of  a  crystal) 
and  the  Schrodinger  equation  in  momentum  (i.e.  FT)  space.  We  further  note  that  the  FT 
between  position  and  momentum  is  a  central  notion  in  Quantum  Mechanics.  It  corresponds 
to  a  kind  of  ’’built  in”  feature  of  Quantum  Mechanics  and  constitutes  the  fundamental  link 
between  the  position  and  the  momentum  of  a  quantum  mechanical  particle,  well  known  as 
the  Uncertainty  Principle. 

Note:  The  above  presentation  holds  for  a  simplified  model  of  electrons  moving  indepen¬ 
dently  to  each  other  (no  interelectronic  interaction)  in  the  potential  created  by  the  nuclei  of 
the  crystal.  Such  an  approximation  is  of  course  inappropriate  for  a  correct  quantum  descrip¬ 
tion  of  molecular  orbitals,  but  it  is  sufficient  for  the  determination  by  Direct  Methods  of 
approximate  phases  of  the  Fourier  coefficients  EH  (with  observed  moduli). 

12.  Appendix 

The  electron-nuclei  attractive  potential  for  N  atoms  of  atomic  number  Zj  at  positions  Tj 
is  (in  atomic  units): 

V(r)  =  -  Ej  Zj  /  |r  -  r^  |  with  r  G 

If  r  plays  the  role  of  time  and  K  E  that  of  frequency,  then  the  FT  at  K  of  V(r)  is : 

Wk  =  -  Ej  [  Zj  exp  ( 27r  i  K.  Tj  )  ]  /K^ 


reminding  that  in  3D  we  have: 

l/|r|  ^FT=^1/|K|2 

Note  that  for  simplicity  the  above  notation  of  modulus  for  the  3D  vectors  r  and  K  has 
been  omitted  throughout  the  text. 

The  Fourier  coefficients  E^-  (so  called  normalized  structure  factors)  involved  in  diffrac¬ 
tion  experiments  have  a  similar  expression  apart  for  the  (positive)  factor  1/K^  .  Indeed,  the 
FT  at  K  of  N  identical  atoms  -considered  as  Dirac  masses  at  positions  Tj  -  is  given  by: 

E/r  =  [  exp  (  27r  i  K.  rj  )  ]  /  ^/N 

For  atoms  of  atomic  number  Z,  eq.  (4)  follows. 
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Abstract.  This  introductory  paper  presents  a  method  for  the  analysis  of  differential 
equations  with  polynomial  coefficients  which  also  provides  a  further  insight  into  the 
Stokes  Phenomenon.  The  method  consists  of  a  chain  of  steps  based  on  the  concept 
of  the  Stokes  Structure  and  Fourier-like  transforms  adjusted  to  this  Stokes  Structure. 
Although  the  main  object  here  is  Bessel’s  equation  our  approach  can  be  extended  to 
more  general  matrix  equations.  It  will  be  shown  (i)  how  to  derive  the  Stokes  Struc¬ 
ture  directly  from  differential  equations  without  any  previous  knowledge  of  Bessel  or 
hypergeometric  functions,  (ii)  how  to  adjust  Fourier  transforms  to  the  Stokes  Struc¬ 
ture,  (iii)  how  to  answer  questions  on  the  interrelation  between  formal  and  actual 
solutions  of  Bessel’s  equation  using  Fourier  Analysis,  and  finally  (iv)  how  to  evaluate 
the  coefficients  of  the  Stokes  Structure,  thus  providing  a  new  insight  into  the  Stokes 
Phenomenon. 


1.  Introduction 

In  [4],  [5]  an  approach  for  the  study  of  a  general  class  of  matrix  differential  equations 
with  polynomial  coefficients  was  presented.  However,  this  study  does  not  cover  many  equa¬ 
tions  which  require  special  attention.  One  such  case  is  the  classical  Bessel’s  equation.  It  was 
explained  in  [3]  how  to  derive  properties  of  solutions  of  Bessel’s  equation  from  the  Fourier- 
dual  hypergeometric  equations.  In  particular,  it  was  shown  how  the  monodromic  properties 
of  hypergeometric  functions  are  transfered  to  solutions  of  Bessel’s  equation  as  algebraic  re¬ 
lations. 
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The  Hankel  functions  {z)  and  (z)  of  order  f  (or  Bessel  functions  of  the  third 
kind)  are  unique  solutions  of  Bessel’s  equation 

.2 


(1.1) 


satisfying  the  Hankel  inequalities  (or  expansions) 

(1.2, 


(1.3) 


(z) 


2\ 


1/2 


TTZ 


as  z  +00.  They  can  be  continued  analytically  as  single-valued  functions  to  the  whole 
Riemann  surface  of  logz  :  0  <  \z\  <  oo,  -oo  <  argj?  <  +oo. 

The  functions  Pi(z),  P2{z)  defined  by 


(1.4) 


H<-y(z)  = 


2 

TTZ 


1/2 


f{z-UTr/2-!t/A) 


(1-5) 


2\ 


1/2 


TTZ 


I 


-i-(z-i/7r/2-7r/4) 


are  known  as  the  phase  amplitudes  of  the  Hankel  functions  h\}\z),  H^u\z)  respectively.  It 
follows  that 

(1.6)  Pi(^)  =  l  +  o(l),P2(>^)  =  1  +  0(1) 

as  2:  ->•  +00.  They  can  also  be  extended  as  analytic  single-valued  functions  to  the  whole 
Riemann  surface  of  log  z.  Moreover,  they  satisfy  respectively  the  following  pair  of  differential 
equations 

(1.7)  CiPi{z)=0,C2P2{z)  =  0 
with  the  pair  of  differential  operators  £i,  C2  defined  by 

(1.8)  Cl  =  z^DI  +  2iz^Dz  -  b 
and 

(1.9)  Cq  =  z^Dl  -  2iz^Dr  —  h 

where  6  =  -  1/4  and  ^ 
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On  the  other  hand  there  exists  a  unique  pair  of  factorially  divergent  power  series 

oo  oo 

(1.10)  =  l  +  =  l  + 


m=l 


m=l 


formally  satisfying  equations  (1.7)  respectively. 

It  is  natural  to  introduce  the  Fourier-dual  operators  C{y  to  £i,  £2 


(1.11) 

£I  =  ^(e-2j)D|  +  2(e-i)£>5-6 

(1.12) 

where  77^  =  ^- 

There  exists  a  unique  pair  Pi  (0  ?  P2  (0  of  solutions  of  £*Pi  (^)  =  0,  C2F2  (f)  =  0 
respectively,  analytic  at  the  singular  point  ^  =  0.  This  pair  is  nothing  but  the  pair  of  Gauss 
hypergeometric  functions 

(1.13) 

(1.14) 

It  is  not  difficult  to  check  that  the  formal  power  series  Pi[z)^  A (>2^)  can  be  represented 
respectively  as  formal  Laplace  transforms  of  the  formal  hypergeometric  series 

(1.15) 

-  l')m 

^  (2j)’"(l)™m! 

(1.16) 

i\m(2 +^)’n(2  “ 

>  (2i)"‘(l)„m!  ^ 

where 

(1.17)  (a). 

n  =  a{a  +  l)...(a  +  m  1)  =  ^ ^ , 

while  the  phase  amplitudes  Pi  {z) ,  P2  {z)  can  be  represented  as  classical  Laplace  transforms 
of  (1.13),  (1.14)  respectively,  see  [3].  In  other  words,  these  formal  series  and  the  phase  am¬ 
plitudes  are  generated  in  the  same  manner  by  different  branches  of  the  same  hypergeometric 
function. 
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Moreover,  using  (1.4),  (1.5)  we  obtain  the  following  integral  representations  of  Hankel 
functions 

(US)  «.)(.)=  1,1)^ 

(1.19)  HP(,)=  (l_„,l  +  ..l,4),i5, 

which,  upon  using  the  monodromic  properties  of  hypergeometric  functions,  yield  the  follow¬ 
ing  monodromic  relation,  see  [3] 


(1.20) 


{I  0\(P,  (^e^-O  \  _  M  Tie'2-  \fp,  (z)  \ 

[  -r2e2-  1  ;  P2{ze^^^)  ;  V  0  1  )\  P2{z)  ) 


where  Ti ,  T2  are  complex  constants. 

This  relation  suggests  an  algebraic  structure  for  the  phase  amplitudes  P\{z) ,  P2  (^)  on 
the  Riemann  surface  of  logz,  which  will  form  the  basis  of  our  present  investigation.  The 
principal  idea  of  this  paper  is  to  apply  Fourier  transforms  to  this  algebraic  structure  rather  than 
to  the  original  differential  equation.  It  should  be  noted  in  fact  that  our  approach  presented  in 
Sections  2,  6,  7,  8,  9,  10  to  follow  does  not  depend  on  the  original  differential  equation. 


2.  The  Stokes  Structure  6 

Definition  A  pair  of  functions  Pi  (.2:) ,  P2(z) 

(i)  analytic  on  the  Riemann  surface  of  logz  with  at  most  exponential  growth  at  z  ~  00  in 
every  sector  S a, p  =  {z  :  -00  <a<argz<p<  -l-oo} 

(ii)  satisfying  inequalities 

(2.1)  Pi{z)  =  14-  0(1) ,  z  cx),  z  G  Sc(l) 

(2.2)  P2(z)  -  1  +  0(1) ,  z  ^  00,  z  e  Sc{2) 
in  the  closed  subsectors 

(2.3)  5'c(l)  C  S(l)  =  {z  :  -TT  <  argz  <  27r,  0  <  l^l  <  00} 

(2.4)  5c(2)  C  S(2)  {z  :  -27r  <  argz  <  tt,  0  <  |2:|  <  00} 

(iii)  satisfying  the  monodromic  relation  (1.20)  with  complex  constants  Ti,  T2  written  as 

(2.5)  Pi{ze^^^)  -  Pi{z)  +  TiP2{z)e~^^^ 

(2.6)  P2{ze‘^")  =  P2{z)  +  T2Pi{ze^ne^^^ 
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are  the  elements  of  the  Stokes  Structure 
(2.7)  e  =  {P,(z),P2(z)l 

3.  From  Differential  Equation  to  6 

This  technique  does  not  require  any  previous  knowledge  or  properties  of  the  solutions  of 

(1.1)  nor  of  the  hypergeometric  functions. 

Definition  The  rays 

(3.1)  l=^{z:  Re(iz)  =  0} 

are  called  separationrays  for  (LI). 

Let  us  look  for  solutions  ?/i,  2/2  of  (1-1) 

(3.2)  2/l(z)  =  j 


"  1  ^  Piiz) 


such  that 

(3.4)  Piiz)  =  1  +  o(l),  P2iz)  =  1  +  0(1) 

as  2:  00  along  a  separation  ray  I  on  the  Riemann  surface  of  log  z. 

In  terms  of  Pi(z) ,  P2{z)  the  differential  equations  (1.7)  together  with  conditions  (3.4)  can 
be  equivalently  rewritten  respectively  as 


^  ^  2t  .L  up  2^  /n  (w  +  zP 


P2{z)  —  1  + 


P2{W) 


,  (w  +  zy 


with  00;  =  00  • 

The  integral  equations  (3.5),  (3.6)  can  be  analyzed  using  successive  iterations  to  con¬ 
struct  the  unique  solutions  Pi{z)  ,P2{z)  satisfying  inequalities  (3.4)  respectively,  see,  for 
example  [2].  Further  analysis  of  these  integral  equations  for  a  specially  chosen  I  shows  that 
7^1  (z) » P2  {z)  fonn  in  fact  the  Stokes  Structure  &  defined  above  by  (2.7). 
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4.  Formal  and  Actual  Solutions 

Choosing  the  separation  ray  arg  2:  =  0  as  the  paths  of  integration  in  (3.5),  (3.6)  respec¬ 
tively  to  construct  the  solutions  Pi{z),  P2{z)  and  using  the  uniqueness  of  this  pair  and  that 
of  ahoy iM 

(4.1)  =  yii^)  =  Pj(2) 

(4.2)  H^?{z)  s  3/2(2)  =  j  e-4z-W2-./4) 

which  are  identical  to  (1.4),  (1.5).  Thus  the  solutions  of  (3.5),  (3.6)  for  this  chosen  separa¬ 
tion  ray  are  nothing  but  the  phase  amplitudes  Pi{z),  P2iz)  of  the  Hankel  functions  Hl}\z)y 
H^u\z)  respectively. 

Another  pair  of  linearly  independent  solutions  of  (1.1)  is 

(4.3)  m{z)  =  j  A  (2) 


(4.4) 


^2(2) 


where 

(4.5) 


A(2)  =  E 

m=0 


(2  ~b  ^)m(2  ^)m 

(2i)’"(l)m 


(4.6) 


A{2)  =  ^(-1) 

m=0 


(2  ^)m(2  ^)m 


Formal  substitution  of  these  solutions  into  (1.1)  yield,  after  canceling  the  exponentials, 
power  series  in  z~^  with  zero  coefficients.  However,  the  above  power  series  are  clearly 
factorially  divergent  for  any  z  if  1/ is  not  a  half  integer.  Thus,  these  solutions  can  be  regarded 
as  foniial  solutions  as  opposed  to  actual  solutions. 

Three  natural  questions  arise  immediately: 

(1)  how  to  relate  the  pair  of  formal  solutions  one  to  another, 

(2)  how  to  relate  the  pair  of  formal  solutions  to  actual  solutions  h\}\z),  i^l?^(^), 

(3)  how  to  decode  properly  the  symbol  o  (1)  in  the  expansions  above. 

Stokes  (1857)  was  the  first  one  to  formulate  and  answer  the  first  two  questions  for  Airy’s 
differential  equation  y"  —  zy  =  0  related  to  Bessel’s  equation  for  z/  =  ”.  To  answer  the  third 
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question,  Poincare  (1886)  considered  formal  solutions  as  asymptotic  representations  of  actual 
solutions.  However,  as  discovered  a  century  later,  see  [1],  this  approach  is  not  satisfactory 
since  it  does  not  answer  question  (1)  altogether,  only  answers  partially  question  (2),  and  does 
not  provide  sufficient  information  about  the  remainder. 

5.  The  Stokes  Phenomenon 

Using  (4.1),  (4.2)  the  monodromic  relations  (2.5),  (2.6)  can  be  rewritten  in  terms  of 
(z),  hP  (z)  as 

(5.1)  (ze^")  =  (z)  +  (z) 

(5.2)  (z)  +  (ze^’'*) . 

These,  in  turn,  yield  extended  Hankel  expansions  valid  outside  the  sectors  in  (2.3),  (2.4). 

All  these  Hankel  expansions  are  of  the  form 

(5.3)  z-‘/^(y4(j/)e"  +  B(i/)e-“). 

Again,  Stokes  (1857)  was  the  first  to  discover  that  the  constants  A  (j/)  and  B  (u)  are 
discontinuous  as  arg  z  changes  continuously  when  crossing  the  separation  rays.  The  existence 
of  such  discontinuities  is  called  the  Stokes  Phenomenon  and  the  corresponding  values  of 
the  jumps  in  A{i/)  ^  B  (u)  can  be  expressed  in  terms  of  connection  coefficients  Ti,  T2  very 
important  in  many  applications.  A  modem  insight  into  the  Stokes  Phenomenon  can  be  found 
in[l]. 

A  fourth  question  then  arises: 

(4)  how  to  evaluate  the  connection  coefficients  Ti,  T2. 

Starting  with  the  Stokes  Structure  we  will  present  a  technique  that  answers  questions 
(l)-(3).  The  culmination  of  our  approach  will  be  to  answer  question  (4),  obtaining  explicit 
expressions  for  the  connection  coefficients  independently  of  any  knowledge  of  the  actual 
solutions  of  the  differential  equation. 

6.  Fourier-Like  Transforms  Adjusted  to  6 

Let  Pi  (z) ,  P2  [z)  be  elements  of  6  with  (unknown)  Ti,  T2  in  its  monodromic  relations 
(2.5),  (2.6)  and  5c  (1)  C  5  (1) ,  5c  (2)  C  5  (2)  a  pair  of  closed  subsectors  with  angles  greater 
than  TT. 

Let 

(6.1)  H  {z)  —  a{^z  {I  ~\- o  {\)) ,  z 00 
be  analytic  on  the  Riemann  surface  of  log  z,  and 

(6.2)  C(l/z)  —  Co +  ciz  + , . . 
an  entire  function  with  complex  ao  ^  0;  Cq,  Ci, . . .. 
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We  define  the  general  Fourier-like  transforms  of  Pj(z),  j  =  1,2  as 

(6.3)  Fj  iO  =  ^  {z)  P,  (z)  dz/z,  j  =  1, 2 

with  paths  of  integration  7  (1) ,  7  (2)  as  boundaries  of  5c  (1),  Sc  (2)  respectively,  oriented  so 
that  Sc  O’)  are  to  the  right  of  7  0)- 

7.  Main  Result 

Theorem  1 .  Let  Pi  {z) ,  P2  {z)  be  the  elements  of  the  Stokes  Structure  ©. 

Then  for  each  j  =  1, 2  in  the  dual  complex  (—plane 

(i)  there  exists  a  ray  Ij  emanating  from  the  origin  such  that  Fj  (^)  is  continuous  for(  e  Ij, 
and  Fj  (^)  can  be  continued  analytically  to  some  open  sector  containing  the  ray  Ij; 

(ii)  there  exists  a  neighborhood  of  the  origin  such  that  Fj  (f)  can  be  further  continued 
analytically  to  this  neighborhood  as  a  single-valued  function; 

(iii)  moreover,  Fj  {()  can  be  continued  analytically  to  the  whole  (-plane  along  every  path 
not  crossing  the  point 

(7.1)  eo  =  ^  (-1)"''  • 

do 

8.  From  6  to  Formal  Power  Series 

Consider  the  special  cases  of  Fourier-like  transforms  (6.3)  for  H  (z)  —  z ,  C  (z)  =  1. 
These  are  nothing  but  the  Borel  transforms  of  Pj(z) 

(8.1)  F}'’^  (0  =  ^  f  {z)  dz/z,  j  =  1, 2. 

2m 

Their  inversion  formulae  are  nothing  but  the  Laplace  transforms  of  pj”^  (^) 

(8.2)  Pj  {z)  =  z  f  (?)  d?,  j  =  1, 2. 

Jlj 

Due  to  (i),  (ii)  of  Theorem  1  the  integrals  (8.1)  are  absolutely  convergent  for  (  €  Ij  and 
(0  can  be  represented  by  their  Taylor  series,  which  can  be  regarded  as  formal  power 
series  in  ( 

00 

(8.3) 

k=0 

Substituting  (0  for  Fj°^  {()  in  (8.2)  and  writing 

dcf  7  I 

Hk  =  F.PjJ 


(8.4) 
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yield 

r  oo 

(8.5)  PP  (z)  =  W  (0  =  E  %  j  =  1’  2- 

k=o  ^ 

The  symbol  fps  means  that  (8.5)  should  be  perceived  on  the  level  of  formal  power  series. 

9.  Formal  Series  as  Strong  Expansions 

Although  for  an  element  Pj  of  the  Stokes  Structure  (2.7) 

(9.1)  lim  Pj(2)  =  1, 

z~^ooz£Sc{j) 

it  is  not  at  all  obvious  that  the  Stokes  Structure  guarantees  the  next  limits 

(9.2) 

However,  the  formal  series 

(9.3) 


lira 

z->ooz€5c(j) 


EO‘l,k 

zk  '  2-^  zk 
fc=0  ^  fc=0  ^ 

are  Poincar6  asymptotic  expansions  for  Pi  {z) ,  P2  {z)  in  sectors  5  (1) ,  5  (2)  respectively. 
This  means  that  for  any  subsector  Sc  (j)  of  S  {j)  and  for  z  G:  Sc  {j)  there  exists  Mj^  >  0  such 
that  the  following  estimates  are  valid  for  A  =  1, 2, . . . 

^-1  I 


(9.4) 


fc=0 


\N- 


It  should  be  noted,  however,  that  these  approximations  are  too  rough  to  provide  real 
information  about  the  behavior  of  the  remainders 

N~1 

^k 
Z^ 


(9.5) 


A:=0 


since  we  don’t  know  how  M  depends  on  N. 

In  fact  the  formal  series  (9.3)  are  much  better  and  more  precise  asymptotic  expansions 
for  Pj  {z)  than  the  Poincare  expansions. 

Theorem  2.  For  any  subsector  Sc  (j)  of  S  (j)  and  for  z  £  Sc  (j)  there  exists  a  >  0 
depending  only  on  Sc  {j)  such  that  the  following  estimates  are  valid  for  TV  =  1, 2, . . . 


(9.6) 


«:=0 


Ma'^m 


N' 


These  expansions  are  known  as  strong  asymptotic  expansions,  see  [7],  [6].  In  contrast  to 
Poincar6  expansions  they  have  the  following  uniqueness  property: 
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Watson ’sTheorem.  Watson’s  Theorem  If  Pi  (z) ,  P2  {z)  are  analytic  functions  in 
a  sector  S  with  its  angle  not  less  than  tt,  and  strong  asymptotic  expansion  in 

5,  then  Pi  {z)  =  P2  [z) . 

The  inequalities  (9.6)  answer  our  question  (3). 


10.  Power  Series  Representation  of  Fj  (f ) 

Now  that  we  have  ^  natural  to  formally  substitute  these  for  Pj  (z)  into  the 
general  Fourier-like  transforms  (6.3)  to  yield  the  formal  Fourier-like  transforms 

(10.1)  Pi  iO  =  ^  f  (2)  Pf  (2)  dzjz 

resulting  in  the  power  series  in  f 

00  00  /  k 

(10.2)  Fj  (^)  =  ^^^k  I  "^2/ 

k=0  k-0  \m=0 


with 

(10.3) 

and 

(10.4) 

Theorem  3 .  The  power  series  Fj  (^ )  are  absolutely  convergent  and  thus  represent  the 
analytic  functions 

(10.5)  Fj  (?)  =  ^  C"  (  Sk  I  y] 

k-0  \  \m=0 

inside  the  circle  of  radius  with  its  center  at  ^  =  0.  Moreover,  if  ^  €  Ij  and  |^|  ^  \l\  then 

(10.6)  Pj(asFj(0. 


Thus,  the  Fourier-like  transforms  Fj  ($)  can  be  represented  both  by  the  integral  transforms 
(6.3)  and  by  the  convergent  Taylor  series  (10.5)  for  f  €  Ij,  |  <  |5oh  where  Ij  and  ^0  are 
defined  in  Theorem  1  (i)  and  (iii),  (7.1)  respectively. 
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11.  Evaluation  of  Borel  IVansfonns 


Now  let  us  return  to  Bessel’s  equation  (1.1)  and  remember  that  the  elements  Pi  {z) ,  P2  {z) 
of  the  Stokes  Structure  (2.7)  are  the  phase  amplitudes  of  the  Hankel  functions  {z)  , 
(^). 

It  follows  from  Theorem  2  that  in  particular  the  formal  series  (9.3)  are  Poincard  asymp¬ 
totic  expansions  of  Pi  {z) ,  P2  [z).  On  the  other  hand,  one  can  derive  from  integral  equations 
(3.5),  (3.6)  that  the  formal  power  series  (4.5),  (4.6)  are  also  Poincare  asymptotic  expansions 
of  Pi  {z) ,  P2  {z) .  It  should  be  noted,  however,  that  it  is  a  hard  problem  to  derive  from  in¬ 
tegral  equations  (3.5),  (3.6)  that  the  formal  power  series  (4.5),  (4.6)  are  strong  asymptotic 
expansions  for  Pi  {z) ,  P2  {z) . 

The  uniqueness  property  of  Poincar^  asymptotic  expansions  yields 


(11.1) 


{k  +  '^Uk-^)k 


(11.2) 

that  is 


0-2,k  =  (  — 1) 


(11.3)  Pj{z)  =  P/\z) 

and  the  left-,  right-hand  sides  of  (1 1.3)  are  defined  by  (8.5)  and  by  (4.5),  (4.6)  respectively. 

It  is  worth  noting  that  (11.3)  is  in  fact  the  converse  of  an  important  principle  that  was 
named  in  [3]  as  the  Principle  of  Functional  Closure:  If  a  formal  series  satisfying  a  differen¬ 
tial-difference-algebraic  relation  can  be  summed  to  an  analytic  function  in  a  region  of  the 
complex  plane,  then  this  function  satisfies  exactly  the  same  relation  in  this  region. 


It  follows  from  (8.4)  and  (8.3)  that 
(11.4)  + 

where  P  1/,  |  —  z/,  1,  ±^)  are  power  series  expansions  in  f  of  Gauss’  hypergeometric 
function 

F  +  ±1) ,  respectively. 


12,  Interrelation  between  Solutions 

It  follows  from  (11.4)  and  (8.1)  that  the  Borel  transforms  of  the  phase  amplitudes  of  the 
Hankel  functions  are  in  fact  the  hypergeometric  functions,  while  the  formal  Borel  transforms 
of  the  formal  power  series  are  the  corresponding  (formal)  hypergeometric  series. 
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The  following  relations  are  valid 


(12.1) 

1 

27rz  j 

f  e^^Pi  {z)  dzjz 
^(1) 

(12.2) 

].± 

1  2ni 

f  e^^P2  (z)  dzjz 

(12.3) 

1 

l<M 

f  e^^Pi{z)dz/z 
hii) 

(12.4) 

\  ^  ^ 

'  27ri 

f  e^^p2  {z)  dz/z. 

A(2) 

The  representations  (12.1)-(12.4)  together  with  their  respective  inversion  formulae 


(12.5)  =  + 

(12.6)  P2(z)  =  zj^  (5  ■“ 

(12.7)  A  (A  Z  e-^^F  (5  +  ^,  J  1, 1)  d? 

(12.8)  A  (z)  =  z  I"  e-^(F  + 

reveal  the  following  one-to-one  correspondences  (denoted  by  the  symbol  t->)  below 


(12.9) 


A  [z)  o  A  (0  =  f  (i + I  - 1/,  1,  ±i)  ^ 

o  F  (i  -I- 1/,  I  -  i',  1,  ±^)  o  Pj  (z) . 

These  interrelations  show  that  both  formal  series  A  (z) ,  A  (z)  and  actual  functions  Pi  {z) , 
P2  {z) ,  are  generated  in  the  same  manner  by  different  branches  of  the  same  hypergeometric 
function,  thus  answering  questions  (1)  and  (2). 


Remark 

Formulae  (12.5),  (12.6)  together  with  (4.1),  (4.2)  yield  again  the  integral  representations 
(1.18),  (1.19)  for  Hankel  Functions.  It  is  curious  that  we  could  not  hnd  these  representa¬ 
tions,  the  most  principal  in  our  context,  in  the  classical  literature  on  Bessel  functions.  In  the 
literature,  the  Hankel  expansions  are  commonly  derived  from  the  representations 


(z)  = 


r  (!->')  nr 


TT^f^i 


[  {f-iy 

J'Yi 


dt 


(12.10) 
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(12.11) 


^dt 


with'yu  72  simple  loops  bypassing  t  =  ±1  but  not  enclosing  t  =  =f1.  respectively,  |arg2|  < 
|,  and 

These  are  derived  by  reducing  BesseVs  equation  to 
(12.12)  zw" (2j/ +  1)  w' +  zw  =  0 


for  the  variable  w  =  and  then  applying  the  Laplace  transform  to  this  special  equa¬ 
tion  with  linear  coefficients.  Unfortunately  this  approach  is  generally  not  possible  for  other 
differential  equations. 


13.  The  Connection  Coefficients 

Consider  the  Fourier-like  transforms  ,  F2  (^)  of  Pi  (z) ,  P2  [z)  defined  by  (6.3) 
with 

(13.1)  H{z)  =  2iz,C(z)  =  -—,b  =  v^-\. 

Z  4 

(13.2)  Fi  (^)  =  -^  /  (z)  dziz,  j  =  1, 2. 

Theorem  4.  Let  Pi  (z) ,  P2  (z)  be  the  phase  amplitudes  of  (z),  (z)  with 

Fourier-like  transforms  Pi  (C) ,  P2  ($)  defined  by  (13.2).  Then 

(i)  the  only  finite  singular  point  of  both  analytic  functions  Pi  (f)  ,  — P2  (— f)  is  ^  =  1 

(ii)  the  limiting  values  of  Pi  (^) ,  -P2  (-^)  at  f  —  1  exist  and  are  equal  to  connection 
coefficients 


(13.3)  limPi  (0  =  Ti,  Urn  (-P2  (-f))  =  T2 

(iii) 

(13.4)  T,  =  (-If  ^  (2)  dz/z,  j  =  l,2 

2i  2 

where  7*  (j)  are  obtained  by  rotating  7  (j)  into  positions  where  functions 
are  decreasing  for  z  6  7*  (j) ,  j  =  1, 2  respectively. 

(iv)  Moreover,  let  fj^k  be  coefficients  of  power  series  in  (10.2)  for  H  (z)  and  C  (z)  given 
by  (13.1).  Then 
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(13.6)  = 

°  \k=0 


It  follows  from  Theorems  1  and  2  that  for  1^  |  <  1 


(13.7) 

FiiO  =  + 

“  im  4- 1)!  m\ 

m=0  ^  ' 

F2{0  = 

+  rn\  ^ 

m=0  '  ' 

hence 

(13.8) 

F,  iO  =  -F2  (-0  =  -2^ibF  Q  -  1  +  I/,  2,  . 

Substituting  ^ 

=  1  yields 

(13.9) 

Tj  =  -2iTbiF  +  2.  0 . 7  =  1>2 

which,  using  Gauss’  formula,  reduces  to 

(13.10) 

T  —  7  —  12 

r(i  +  i  +  .)r(i  +  i-.)’' 

and  finally 

(13.11) 

Tj  —  2icos7rzv,  j  =  1,2. 

It  is  worth  noting  that  generally  it  is  impossible  to  express  Tj  in  terms  of  known  fuctions. 
Their  integral  representation  should  be  used  to  evaluate  them  asymptotically  for  extremal 
values  of  parameters  of  the  differential  equation. 

Their  Taylor  series  representation  should  be  used  for  their  numerical  evaluation. 

14.  Conclusions 

We  have  shown  that  the  Stokes  Structure  6  is  of  fundamental  importance.  Starting  with 
Bessel’s  equation  (1.1)  we  derived  6  and  introduced  and  studied  Fourier-like  transforms 
adjusted  to  0.  These  yielded  formal  power  series  that  are  in  fact  formal  solutions  of  (1.7). 
Furthermore,  as  shown  by  (12.9)  the  phase  amplitudes  and  their  respective  formal  series  are 
generated  in  the  same  manner  by  different  branches  of  the  same  hypergeometric  function. 
These  provide  the  basis  for  a  systematic  chain  of  steps  to  answer  questions  (1),  (2),  (3),  (4) 
above,  and  an  approach  which  can  be  extended  to  matrix  equations  with  many  applications. 


V.  Gurarii,  et  al  /How  to  Use  the  Fourier  Transform  in  Asymptotic  Analysis 


401 


References 

[1]  Braaksma  G.  Imniink  and  M.  van  der  Put,  (eds)  The  Stokes  Phenomenon  and  Hilbert's  16th  Prob¬ 
lem,  Singapore:  World  Scientific,  1996, 326  pp. 

[2]  CoddingtonE.  A.  and  N.  Levinson.  Theory  of  ordinary  differential  equations.  New  York:  McGraw  Hill, 
1955, 441p. 

[3]  Gurarii  V.  and  V.  Katsnelson.  The  Stokes  Structure  for  the  Bessel  Equation  and  the  Monodromy  of  the 
Hypergeometric  Equation.  Preprint  3/2000,  NTZ,  Universitat  Leipzig,  Preprint  is  available  from  the  WEB 
site  http://  www.uni-leipzig.der ntz/prentz.htm 

[4]  Gurarii  V.  and  V.  Matsaev.  The  generalized  Borel  transform  and  Stokes  multipliers.  In:  Theoretical  and 
Mathematical  Physics,  Vol.lOO ,  No  2,  173-182  pp.,  Moscow,  1994. 

[5]  Gurarii  V.  and  V.  Matsaev.  The  Generalized  Borel  Transform  in  Asymptotic  Analysis.  In:  The  Role 
of  Mathematics  in  Modem  Engineering,  (eds.  A.K.  Easton  and  J.M.  Steiner)  585-597  pp.  Sweden, 
Lund:Chartwell-Bratt,  1996. 

[6]  Reed  M.  and  B.  Simon.  Methods  of  Modem  Mathematical  Physics,  IV,  Analysis  of  Operators  Academic 
Press,  New  York  San  Francisco  London,  1978, 428p. 

[7]  Watson  W.  A  Theory  of  Asymptotic  Series.  In:  Phil.  Trans.  Roy.  Soc.,  Ser.  A  211,  279-313.  London, 
1911. 


Index 


B-functions,  184 
Gs  set,  58 
Gs  set,  141 
M  sets,  67 
Ma  sets,  67 
X-dominated,  243 
X-spectral  hull,  249 
e-compression,  13 
a-pair,  8 
a-pair,  strong,  9 
n-resolvent,  242 
n-visible,  237 
p-oscillation,  133, 134 
2:-transform,  381 
hard  summation,  103 

A/D,  298 

Abiguity  Problem,  312,  314 
abiguity  problem,  312-314 
absorbtion  condition,  259 
admissibility  condition,  334-338, 
Admissible,  365 
admissible,  337,  338,  340 
Affine  Dual,  364 
affine  dual,  341 
Affine  Group,  335 
affine  group,  174, 333, 335,  338 
Affine  System,  366 
affine  system,  339-341 
Airy’s  differential  equation,  392 
Aleksandrov,  A.B.,  14 
almost  periodicity,  274, 275 
{a,  j3}function,  224 
Amplifiers,  296 
amplifiers,  296 
amplitude  modulation,  298 
analytic  bootstrapping,  213 
Annihilating,  8 
annihilating,  8 
antenna,  296, 299,  300, 316 
antilocality,  24-27 
approximation  theory,  4, 203 
arithmetic,  7 
atomic  positions,  164 


atomicity,  163, 164 
attentuation,  299, 325 
Auscher,  360 
Auslander,  303, 305,  328 

Baire,  57-59, 61, 65, 70, 71 

Baire’s  space,  141 

Balian-Low  Theorem,  179, 180 

Balian-Low  theorem,  76, 93, 101 

band  limited,  5, 17, 20, 23 

bandwidth,  296, 299,  300,  309, 310, 325, 327 

Bari,  15, 28 

Barker  codes,  320, 322 

Barker  on  Barker,  321,  323 

Barker  sequences,  226, 320,  321 

Bastiaans,  74, 75, 94,  99 

beams,  296 

Beck,  284, 292 

Belov,  15 

Benedicks,  13, 14,  23 

363  Bernstein  norm  estimate,  14 

Bernstein’s  inequality,  202 
Bernstein,  S.N.,  202 
Besov,  362 

Besov  spaces,  127, 128, 130-132, 135, 144 
Bessel  family,  340 

Bessel  functions,  145, 146, 148, 149, 388, 398 

Bessel’s  equation,  387, 392, 396,  398, 400 

Bessel’s  inequality,  285 

best  polynomial  approximations,  247 

Beurling,  18,  21, 23, 28, 272, 273, 292 

Beurling  algebras,  247 

Beurling,  A.,  40 

Beurling-Malliavin  majorant,  20 

Beurling-Sobolev  algebras,  235, 240 

Bezout  equations,  235 

bi-unimodular,  220 

bi-unimodular  polynomial,  228 

binary  function,  223 

bistatic,  295 

Bjorck,  219 

Bjork,  J.-E.,  241 

block  matrices,  47 

block  shift,  38-40, 42, 45 


403 


404  Index 


BM-majorant,  20 
Bochner’s  theorem,  31,  37 
Bohr  compactification,  257 
Borden’s,  298 
Borel,  62, 63 
Borel  measure,  14 
Borel  measures,  253 
Borel  transforms,  394,  397 
Borgain,  66,  67, 70 
Borichev  Theorem,  22 
Borwein,  R,  217 
bounded  action,  239 
bounded  character,  254 
bounded  support,  5 
Bourbaki,  266 
Bourgain,  27,  28,  288,  292 
Bownik,  M.,  342 
Brennan,  J.,  22 

Brownian  motion,  61, 62,  68, 69 
Bueckner,  312 
butterfly,  151, 152 

Calderdn,  334 

Calderdn  reproducing  formula,  335 
Calderon,  A.,  241 

canonical  dual  system,  76,  78,  80,  82,  99 

Cantor,  58,  68 

Cantor  measure,  15 

Cantor  subsets,  15 

Carleman  formula,  18 

carrier  frequency,  298,  300 

carrier  signal,  296 

Cartwright,  21 

Cartwright  class,  25 

category  theorem,  57,  58,  71 

Cauchy  potential,  17, 22 

causal,  6 

causality  principle,  6 
CC-condition,  87 
Chebishev,  203 
Chen,  283, 292 
Cherednik,  145 

Chinese  remainder  theorem,  152 
Chirp,  319,  321 
chirp,  319,  320, 323 


Clairaut,  220 
Classical  PONS,  326 
clutter,  296,  298 
Cohen,  196, 199 
Cohen,  R,  241 
Cohen,  Paul,  23 1 
Coherent  States,  364,  365 
coherent  states,  338 
coin  tossing,  107 
combinatorial  designs,  201 
combined  spaces,  173,  197 
commutant  lifting  theorem,  31,  33, 52 
complementary  pairs,  325 
completely  (5-visible,  238 
completely  antilocal,  26 
completely  local,  24,  25 
completely  visible,  237 
Complex  Analysis,  14-17 
complex  analysis,  3-6,  21 
complexity,  381 

condensation  of  singularities,  109, 116 
Connection  Coefficients,  399 
connection  coefficients,  393,  399 
connectivity,  357,  360 
constrained  global  optimization,  163 
converters,  298 

convolution  multiplier  norm,  246 
Cooley,  James,  152 
Cooley-Tukey,  151,  152 
corona  problem,  237 
correlation,  301,  317,  320-323 
Costas  arrays,  322,  323,  327 
crest  factor,  218 
cross  ambiguity  function,  306 
crystal  structure,  163-165, 384 
Curtis,  190,  198 
cyclic  difference  sets,  201 
cyclic  groups,  260 
cyclotomy,  201 

Dai,  Xingde,  357 
Daubechies,  175,  198 
dB  plot,  319 

De  Branges,  23,  25,  26,  28 
De  Branges  Theorem,  25 


Index 


405 


De  Buda,  313 

de  Jeu,  145, 148, 149 

De  Leeuw-Katznelson  Theorem,  15 

decibels,  319 

deep  zero,  4, 15, 18, 19 

demodulated,  300 

Denjoy,  57,  58, 70 

DFT,  220 

dichotomy  problem,  66 
difference  sets,  222 
diffraction  experiment,  385 
diffraction  experiments,  384 
diffraction  maxima,  163 
diffraction  pattern,  164 
dilation,  44 

dilation  matrix,  339, 362 

dilation  operator,  329,  346 

dilation  subgroup,  335 

dilation  theory,  33, 43, 47,  52-54 

dimension  function,  356 

Direct  Methods,  171,  384 

direct  methods,  164 

disc  algebra,  381 

discrete  Fourier  transform,  220 

discrete  logarithmic  integral,  5 

distance  controlled  resolvent  growth,  242 

Distribution,  28 

distribution,  3,  4,  6,  16,  19, 22-24 
Divergence,  125 
divergence,  119, 125 
Diximier,  377 
Doppler,  298 

doppler,  295,  296,  298,  300-302,  305,  309,  310, 
315-320,325,327 
dual  frame,  73-75, 78,  83,  84, 179 
dual  group,  260 
Dunkl,  145, 149 
duplexer,  296 
Dyn’kin,  22 
Dyn’kin,  E.,  241 
Dyson,  Freeman,  289 

EFET,  182-184, 195 

efficiency  hull,  235 

efficient  resolvent  complement,  249 


Ehrling,  G.,  241 

El-Fallah,  O.,  241 

electronic  counter-measures,  296 

Enflo,  Per,  220 

entropy,  24,  26 

Erdos-Newman  conjectures,  231 
Erdos,  272 
Erdelyi,T.,217 
essential  spectrum,  8 
essential  support,  8, 10 
Exponential  Sum,  276, 292 
exponential  sum,  271, 276, 278, 281 
extremal,  311,  312 
extremal  problems,  201 

F,  and  M,  Riesz  Theorem,  14 
Fabry  Gap  Theorem,  282 
fast  bilateral  decay,  4 
Fast  Fourier  transform,  151 
Fatou,  57, 71 
Fejer,  204 
Fejer,  247 
Fekete,  204 

Fekete  polynomials,  230 
Filters,  296 
filters,  297, 301 
finite  energy  signals,  305 
first  category,  59 
flatness,  218 

Foias-Sz.-Nagy  result,  52 
Fourier,  220 

Fourier  transform,  4,  9, 29 
Fourier-like  transform,  387,  393,  394,  396,  399, 
400 

Fourier-Stieltjes  transform,  173, 183, 190 
FPGA,  151 

fractal  dimensions,  127, 128, 132 
fractional  brownian  motions,  136 
frame,  78 
frame  bounds,  178 

frame  lower  bound,  78,  81,  85,  89,  92, 93, 96 
frame  operator,  75, 76, 78-87,  89-92, 96, 97, 99 
frame  upper  bound,  77,  78,  80,  82-85,  87-94,  96, 
97 

Frappier,  C.,  218 


406 


Index 


Frisch-Parisi  formula,  136 
full  spectral  hull,  249 
function  spaces,  127-129 
functional  calculus,  33, 38, 46 
Fundamental  maximal  property,  170 

Gabor,  74-76, 84, 98-101, 173-177, 179-181, 196- 
199 

Gabor  frame,  73, 75,  76,  79,  84-86,  88,  89,  92-95, 
97-99 

Gabor  frame  operator,  73,  76,  100 
Gabor  frame  operators,  73 
Gabor  frames,  173 

Gabor  system,  73-77, 79,  83,  85,  87-93, 98, 99 

Gabor  systems,  363 

Gabor  theory,  73,  75-77,  83 

Gallagher,  286, 290, 292 

Galois  field,  323 

Garrig6s,  G.,  360 

Gauss,  151,155,220 

Gaussian  process,  299 

Gaussian  sequence,  220 

Gaussian  series,  62 

Gelfand,  241 

Gelfand  transform,  237 

Gelfand-Naimark  theorem,  37 

generalized  functions,  32 

geometric  mean,  5 

geometric  means,  7 

GHz,  298, 301,319 

Golay,  325,  326 

Golay  sequences,  226 

Goldston,  290, 292 

Golomb,  323,  327 

Gram-Schmidt  orthonormalization,  129 
Gu,  Q.,  359 

Holder  exponent,  136-138 

Haagerup,  222 

Haar  basis,  129, 130 

Haar  measure,  260,  334,  335,  338,  340 

Haar  system,  105, 106, 109, 120, 122,  124 

Haar  wavelet,  329,  360 

Haar,  A.,  221 

Hadamard  Circulant  Conjecture,  219 
Hadamard  difference  sets,  227 


Hadamard  factorisation  theorem,  3 1 3 

Hadamard  lacunary  condition,  65 

Hadamard-Menon  difference  sets,  227 

Hadamard-Paley  cyclic  difference  set,  221 

Hadamard-Paley  difference  sets,  227 

Haldsz,  283 

Halmos,  39-41,46,  55 

Halton,  283,  292 

Hankel  expansions,  393,  398 

Hankel  functions,  388,  389,  392,  396,  397 

hard  summation,  121 

Hardy,  310,  316 

Hardy  class,  6,  19,  28 

Hardy  spaces,  32, 41 

Hardy-Littlewood  maximal  operator,  124 
Hardy-Littlewood  unimodular  polynomials,  230 
Hardy- Weierstrass  function,  60 
Harmonic  sliding  analysis,  375 
Hausdorff  dimension,  60, 68 
Hausdorff  dimensions,  127,  128 
Hayes,  189,  191, 193-195, 198,  199 
Heckman,  145,  149 
Heisenberg,  174 

Heisenberg  group,  173,  303,  305-308,  31 1,  312 

Heisenberg  inequality,  3 

Heisenbery  related,  313,  314 

Helson  sets,  67, 68 

Helson,H.,  241 

Herglotz-F.  Riesz-Toeplitz  theorem,  37 
Hermite  functions,  308-310 
HF,  295 

Hilbert  basis,  304 

Hilbert  space,  31-33,  35,  36.  38-40, 43-46,  50,  52, 
53 

Hilbert  transform,  14,  26,  184-186, 192,  301 
Hilbert’s  Inequality,  286-288 
histogram,  127, 128, 133, 134, 138-140, 142 
Holder  singularities,  127 
homogeneous  Galilei  group,  338 
horodiscs,  250 
Hru§Cev  theorem,  24,  26 
HSA,  375 

Huffman  sequences,  219 
hull-kernel  topology,  377 
hypergeometric,  387,  389-391,  397,  398, 400 


Index 


407 


hypergeometric  series,  145 

I  and  Q  channels,  298, 300, 301 
IF,  297 

Image  enhancement,  168 

impulse  response,  38 1 

induced  representations,  307 

inhomogeneous  Galilei  group,  338 

inner  function,  41 

intensities,  163, 164 

interpolation  formula,  217 

intertwining,  306-308 

intrinseque,  134, 136, 140 

invariant  subspace  theorem,  33 

invariant  subspaces,  32, 40, 43, 53-55 

invisible  spectrum,  235 

irreducible  representations,  303,  305, 312 

isometric  lifting,  45 

Ivashev-Musatov  Theorem,  15 

jamming,  296 
Jensen  inequality,  6,  7 
Joum^e,  J-L,  360 

Komer,  T.,  68 
Komer,  Thomas,  70 

Kahane  and  Katznelson,  theorem  of,  1 19 

Kahane,J-P,241 

Kahane,  J.-R,207 

Kargaev,21,23, 28 

Kargayev,  23,  25,  27 

Katzenelson,  198 

Katznelson,  58, 66, 71 

Katznelson,  Y.,  240 

Kaufman,  Robert,  70 

Kazhdan,  377,  378 

Khinchin-Ostrowski  theorem,  24 

Kislyakov,  23 

Klemes,  Ivo,  231 

Kolmogorov,  14,  104, 121,  122 

Koom  winder,  145 

Koosis,  21, 23, 28 

Kronecker  sets,  67 

Kronecker’s  Theorem,  274 

Kuratowski-Ulam  theorem,  59 


L^vy  flight,  68 
L6vy  processes,  68 
lacunary,  12, 13 
lacunary  Taylor  series,  62 
Lagrange  interpolation,  229 
Laguerre  function,  309 
Landau,  228 
Landau,  E.,  204 
Laplace  transform,  18, 19 
Laplace  transforms,  389, 394 
Large  Sieve,  285 

large  sieve,  272, 285-287, 292, 293 

LCAO  method,  386 

Lebesgue,  57, 58, 62, 65, 69, 72 

Lebesgue  measure,  4,  8, 9, 1 1, 26 

Leblanc,  N.,  241 

left  invariant,  303 

Legendre  symbol,  221 

Lemari^,  360 

Levin,  18, 23, 28 

Levinson,  21 

Liang,  Rufeng,  357 

lift,  43 

lifting,  33, 43, 44, 46, 52,  53, 55 

lifting  problem,  129 

Lim,  189-191, 193, 195, 198, 199 

line  bundle,  307,  308 

linear  chirp,  319 

Linnik,  285,292, 293 

Logan,  184, 186, 187, 190, 193-195, 198 

logarithmic  integral,  5,  8,  20, 22 

Logvinenko,  12,  23 

Lorentz,  G.G.,  205 

Lorenz  norm,  382 

Lovdsz,  221 

low  pass  filter  multiplier,  357 
low  pass  filters,  296 
lower  box  dimension,  134 

M.  Riesz  potential,  26 
Mobius,  7 
Macdonald,  145 
Mackey,  307 
Mackey  theory,  307 
Makarov,  N.,  24 


408 


Index 


Malliavin,  66, 69, 71 
Mandelbrojt  theorem,  15 
Mandelbrot,  68, 71, 72 
Marcus,  62,  65 
Markov’s  inequality,  202 
Markov,  A. A,  202 
matched  filter,  298,  302,  317,  325 
matrix-valued  functions,  3 1 
Maximal  Function,  170 
maximalfunction,  122, 123,  170 
maximum  likelihood,  301 
meager  sets,  59 
mean  ergodic  theorem,  33 
Menchoff  universality  property,  65 
Mendeleiev,  202 
Menon  difference  sets,  227 
Meyer,  360,  365,  366 
microlocal  critical  constant,  249 
microlocal  upper  bound,  249 
Minimal  Function,  171 
minimalfunction,  166, 168 
Minimal  Principle,  166 
minimal  principle,  166 
minimal  spectral  hull,  239 
Minimally  Supported  Frequency,  333 
model  operator,  252 

Modified  Bessel  Function,  165,  166, 169 

modulation,  298 

moment  problem,  18 

Moment  Theorem,  54 

moment  theorem,  31-33,  53 

momentum  space,  385 

monostatic,  295 

Morgan,  18 

Moyal’s  identity,  306 

MRA,  340,  350,  353-362, 366 

MRA  wavelet  multiplier,  357 

MSF  wavelets,  333,  355,  360 

multifractal  analysis,  127, 136 

multifractal  formalism,  128,  135,  137-141, 143 

multiresolution  analysis,  350 

multistatic,  295 

narrow  band,  300,  301,  303,  315,  316 
Nazarov,  13, 14,18,21,23,  29 


Nestoridis  universality  property,  64 
Newman,  D.J.,  230,  240 
Newton  convolution,  27 
Newton-Kantorovich  method,  23 
Neymann-Pearson  Lemma,  301 
nilpotent  semigroups,  260 
noise  process,  299 
non-unimodular  group,  316 
norm-controlled,  249 

normalized  structure  factors,  163-165, 168,  386 
Nyquist’s  theorem,  74 

Olevskii,  103, 108 

Opdam,  145, 148, 149 

Oppenheim,  189-191, 193,  195,  198,  199 

opposite  phase,  298 

optimal  detector,  301 

oscillation  spaces,  132-135 

oscillator,  296 

Ostrowski,  283, 292 

Oversampling,  364,  365 

oversampling,  339 

Paley,  61-63, 67,  223 
Paley-Wiener  theorem,  182,  313 
Paneah  Theorem,  1 1 
parameter-shift  method,  167 
peak  factor,  218 
Peetre  theorem,  24 
Perelomov,  74,  75,  100 
perfectly  flat,  218 
periodic  electron  density,  384 
periodization,  13 
permutation  matrix,  322,  323 
Perrin,  Jean,  61 
Petrov,  A.,  241 

phase  amplitudes,  388-390,  392,  396,  399 

phase  coded  pulses,  298 

phase  modulation,  298 

Phase  Problem,  163 

phase  problem,  163,  164,  166 

phase  relations,  384 

phased  array,  296 

Phragmen-Lindelof  theorem,  18 

Pichorides  S.  K.,  212 

Pichorides,  S.K.,  205 


Index 


409 


Pick-Nevanlinna  theorem,  33 
Piecewise  Zak  Transform,  176 
Pigno,  14 

Pisier,  62, 65,  66, 72 
Pisier  algebra,  65 
Pitt,  R.,  236 

Plancherel  theorem,  330, 331 
plus-charge,  14-16 
plus-function,  6-8, 16, 17,  20,  27 
PN  problem,  48 
Poggio,  184, 198 

Poincare  asymptotic  expansions,  395-397 

Poisson  measure,  5 

Poisson  summation  formula,  152,  273 

Poisson-Jensen  formula,  19,  20 

Pollard,  23 

Pollington,  283 

polynomial,  203 

PONS,  326,  327 

PONS  sequences,  226 

Porat,  196, 198,  199 

positive  definite,  31 1, 312 

positive  definiteness,  32 

potential  function,  385 

potential  theory,  6,  23,  24 

Poussin  C.  de  La  Vallee,  217 

Poussin,  De  la  Vallde,  203 

power  spectral  density,  299 

Power  Sums,  281 

power  sums,  281,  285 

prediction  theory,  42, 53,  55 

Prime  ifc-tuple  Conjecture,  290 

Prime  Number  Theorem,  275,  281 

Principle  of  Functional  Closure,  397 

probability  measure,  6 

property  (T),  377 

pseudo-random  codes,  317 

pseudofunctions,  65 

pseudomeasures,  65 

pulse  compression  waveforms,  320 

quadratic  non-residues,  221 
quadratic  residues,  223 
Quantum  Mechanics,  385 
quasi  all,  59-61,  64,  65 


quasi  everywhere,  59 
quasi  sure,  59,  61,  64 
quasi-affine  dual,  341 
quasi-affine  system,  340,  341 
quasianalyticity,  4,  6, 18 
Quincunx  wavelet,  361 

Renyi,  285 

radar  ambiguity  function,  302,  327, 328 

Radar  System,  295,  328 

radar  system,  295-297, 299, 301 

Rademacher  Taylor  series,  62,  63 

radially  symmetric,  309 

Rahman  Q.  L,  217 

Rajchman,  15 

Rajchman  measures,  66 

Random  Series,  71 

random  series,  62 

range  estimator,  301 

rearrangement,  103, 104,  121 

receiver,  295-297, 299,  301, 302,  316 

receiver  noise,  299 

reciprocal  lattice  vector,  165, 169 

reflection  principle,  107 

reflectivity  kernel,  302 

Reiter’s  condition,  377 

relatively  dense,  1 1 

Requicha,  185, 198, 199 

resolution  ellipse,  309 

resolution  factor,  309,  310 

resolvable,  309 

resolvent  complement,  249 

restricted  ambiguity  partners,  314 

return  signal,  299-302 

RF,296 

RF  mixers,  296 

Riemann,  57, 58,  72 

Riemann  Hypothesis,  275, 289, 290 

Riemann  Lebesgue  lemma,  123 

Riemann-Lebesgue  lemma,  256 

Riemann-Mellin  inversion  formula,  19 

Riesz  basis,  105, 110-113, 115, 117-119 

Riesz  constant,  105, 114 

Riesz,  F.,  205 

Riesz,  M.,  205 


410 


Index 


Riesz,  Marcel,  215 

Riesz-Dunford  calculus,  250 

Riesz-Dunford  functional  calculi,  241 

right  invariant,  303 

RISC,  151-153 

rolling  hump,  109, 116 

Ron-Shen  theory,  73 

Rotem,  185, 186, 198 

Roth,  283-286,292,293 

Rudin,W.,241 

Ryser,  219 

Rzeszotnik,  Z.,  342 

S-band,  300 
Salem,  15,28 
Salem  sets,  67 
Salem,  Raphael,  230 
sample  values,  381 
Sapogov,  23 

Sarason,  31, 48,  50-53, 55,  56 

SAS  tangent  equations,  169, 170 

scaling  function,  350,  353,  355,  357-360, 362,  364 

scaling  function  multiplier,  357 

Schaake-van  der  Corput  inequality,  208 

Schauder  basis,  129 

Schempp,  303,  309,  328 

Schmeisser,  G.,  217 

Schmidt,  283,  284, 293 

Schmidt,  B.,  230 

Schrodinger  representation,  305 

Schrodinger  equation,  385 

Schur,  206 

Schur’s  algorithm,  50 
Schur’s  Lemma,  304 
Schur,  I.,  48 
second  category,  59 
Selberg,  272, 273,287,293 
self-inversive,  212 
sembounded  support,  5 
semicharacter,  254 
semigroup,  254 
separable,  332, 355, 364 
separation  property,  254 
separation  rays,  391 
Sereda,  12 


series,  divergence  of,  103 
series,  divergence  of,  125 
series,  rearrangement  of,  103-125 
set  of  uniqueness,  68 
Shake-and-Bake  algorithm,  163 
Shamai,  194, 195 

Shannon  wavelet,  329,  333,  359,  360,  362 

Shapiro,  325 

Shapiro  sequences,  226 

Shapiro,  H.S.,  230, 240 

Shapiro,  J.,  14 

shift  operator,  38, 40,  53 

shift-invariant  systems,  73, 76,  77, 79,  80,  83,  93 

Shimorin,  S.,  39, 43 

Shitz,  190, 191, 199 

short-time  Fourier  transform,  74 

sidelobes,  296,  319,  327 

Sidon  set,  66, 67,  70 

signal-to-noise  ratio,  296,  301 

simple  spectrum,  35,  36 

Single  Wavelength  Anomalous  Scattering,  168 

single  wavelength  anomalous  scattering,  163 

Slepian-Pollack  inequality,  1 1 

Smith,  14 

Sobolev  space,  25 

Sobolev  spaces,  129, 131, 135,  263 

Soundararajan,  290,  292 

spacious,  23,  24 

SPARSE,  12 

sparse  representations,  131 
sparse  support,  5,  23,  24 
spectra  of  singularities,  127 
spectral  gap,  23 
spectral  hulls,  235 
spectral  magnitude,  369 
spectral  sets,  45 
spectral  synthesis,  69 
spectral  theorem,  32,  34,  35,  37 
spectrally  closed,  253 

spectrum,  4, 6, 10, 14, 15, 17-20, 296,  300,  315 
spectrum  of  singulgularities,  128, 135-137, 141 
Speegle,  D.,  360 

square  integrable  representation,  306 
Sreider  measures,  240 
Sreider,  Yu.,  236 


Index 


411 


Stafney,  J.,  241 

Steinhaus,  58,  62, 72 

stepped  frequency  radars,  322 

Stokes  Phenomenon,  387,  393 

Stokes  Structure,  387, 390,  391, 393-396, 400 

Stone’s  theorem,  31 

Stone-von  Neumann  theorem,  305 

Stromberg,  130 

strong  a-pair,  9 

strong  asymptotic  expansions,  395,  397 

Strong  Pair  Correlation  Conjecture,  289, 290 

strongly  admissible,  141 

Structure  Invariants,  165 

structure  invariants,  163, 165, 167, 170 

sub-representations,  304 

subharmonic,  17 

subnormal  operators,  32 

symmetrised  form,  308 

synthetic  aperture  radar,  320 

Szego’s  inequality,  208 

tangent  formula,  163 
Tao,  103 
Target,  327 

target,  295, 296, 298-302, 305, 309, 315-317, 319, 
325 

Taylor  series,  61-65, 67,  394, 396, 400 
Tchebyshev,  203 
tempered  distribution,  7 
tensor  orthonormal  basis,  311 
theorem  of  Kahane  and  Katznelson,  119 
thin  sets,  57, 65,  68, 70, 72 
thumbtack,  316,  317, 325 

tight  frame,  178, 332,  339, 340,  343-345, 349, 364 
time  dilation,  300 
time  duration,  309 

time-frequency  localization,  375, 376 
time-invariant  linear  system,  381 
Toeplitz  matrix,  34 
Toeplitz  operator,  50, 51,  53,  54 
Tolimieri,  303, 305, 328 
Tolimieri-Orr-Janssen  representation,  76 
totalization,  58 
transfer  functions,  382 
translation  operator,  329, 346 


translation  subgroup,  335 
transmitter,  295, 296,  299 
Triebel-Lizorkim  spaces,  362 
triplets,  165, 166 
Tur^n,  272 

TXirln’s  First  Main  Theorem,  281 
Turan,  14 
Turyn,  R.,  230 
TWIN  algorithm,  385 

uncertainty  inequality,  74 

Uncertainty  Principle,  3,  386 

uniform  spectral  radius,  241 

uniformly  distributed,  272,  274, 276, 277 

unilateral  decay,  21-24 

unit  cell,  164-166, 168 

unital  Banach  algebra,  243 

unitarily  equivalent,  35, 36,  38-40, 45, 46 

Unitary  Dilation,  56 

unitary  dilation,  31, 33, 44, 45 

unitary  operator,  304 

Unitary  Representation,  304 

unitary  representation,  303,  304, 306,  307 

Urieli,  196, 199 

Vaaler,  274,  287,  292, 293 
van  der  Corput,  276-279 
van  der  Corput  set,  277 
Van  Hove,  189, 199 
Van-der-Corput  lemmas,  15 
Vaughan,  287, 291, 292 
VHF,  295 

Vinogradov,  S.,  241 

Voelecker,  185, 190, 194, 199 

Volberg  Theorem,  22 

Volterra  operator,  32 

von  Mangoldt’s  lambda  function,  275 

von  Neumann,  31,  33,  37, 45, 74, 76,  99 

von  Neumann’s  inequality,  38, 44 

Wagner,  284,  293 
Walnut’s  representation,  76 
wandering  space,  45 
Watson’s  Theorem,  395 
wave  functions,  384 
Waveform,  298, 316,  323 


412 


Index 


waveform,  296,  298-300,  302,  309,  310,  312,  314, 
316-323,325,326 

wavelet  coefficients,  127,  128,  130-137,  139,  140, 
142-144 

Wavelet  expansions,  127 
wavelet  expansions,  127,  144 
wavelet  multiplier,  357-359 
wavelet  sets,  333 
Wavelet  Transform,  365 
wavelet  transform,  334,  335 
wavelets,  103-125 

Weak  Pair  Correlation  Conjecture,  289 
Weil-Brezin  formula,  308 
Wermer,  J.,  262 

Wexler-Raz  biorthogonality  condition,  76 
Weyl  operational  calculus,  174 
Weyl  sum,  276, 278 
Weyl’s  Criterion,  272,  274,  277 
Weyl,  H.,271 

Weyl-Heisenberg  frames,  175 

Weyl-Heisenberg  group,  74 

Weyl-Heisenberg  transform,  363 

wide  band  cross  ambiguity  function,  315 

wide-band  ambiguity  function,  316 

wide-sense  stationary  random  sequences,  41 

Wiener,  58,  61,  62,  65 

Wiener  algebra,  58,  239 

Wiener  theorem,  236 

Wiener,  N.,  236 

Wiener-L6vy  theorem,  241 

Wiener-Pitt  phenomenon,  236 

Wiener-Pitt-Sreider  phenomenon,  235 

Wigner  distribution,  175 

Wigner-Ville  distribution,  74 

Wilcox,  303,  308-311,327 

Wilson  systems,  76 

windowed  Fourier  transforms,  173 

Wold  decomposition,  38, 42,  45 

Wolff,  27-29 

Woodward,  315 

worst-case  identification,  381 

Wutam  Consortium,  357 

Zak  Transform,  173 

Zak  transform,  73-76, 90-93, 98, 100, 101,  308 


Zeevi,  175, 177-180, 185, 186, 190, 191,  194-196, 
198, 199 
Zhang,  L.,  362 
Zibulski,  175, 177-180, 199 
Zibulski-Zeevi  description,  76 
Zygmund,  61,  62,  67,  72,  215 
Zygmund  sets,  66, 67 


